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Atflp aequence totowhg Ser*°° and occurs within 
the dome* of Arflp tat shows homology **» hOE 
(74). To delete the compete STE23 sequence and 
crMmestBe3AsiA4driUation. poynerase chain 
reaction (PG R) pnma a (5'-TCG GAAQACCTCAT- 
TUrrGCTCATTTTGATATTGCTO- TGTAGATTG- 
TACTGAGA6TGCAC-3'; and 5'-OCTACAAACAQC- 

gtcgacttgaatgccccgacatcttcqactgt. 

GCG6TATTTCACA00G-3') were usad to ernpfty 
the LRA3 sequence of DRS316. and tie reaction 
product was H ana Juri im J Into yeast tor one-step gene 
iep td n*r»< P- rtothstain. Methods Smyrna. 194. 
281 (1 991 ]). To craate theatf / 0/.-L5U2 nutation corv 
taned on p114, a 5jCMtt> Sal I fragment from pAXLf 
was doned Ho pUCl9. and an internal 4.0-kb Hpa 
MhollraornenlMesreptexdv^aLEl^fragm^ 
To construct the ttn?3*~LEU2 a*ete <• databon cor- 
le ap ondng to 931 amino adds) canted on pi5& a 
iHJB sagrnant was used to repftac* the 2.8-Kb Prri 
l-Edl 36 1 fnjgment of S7EZ3. which occurs within a 
&2*tt> Hnd W-Bot B penomfc fragment carried on 
pSP72 (PrornesB). To create YEpMFM. a 1.6-kb 
Bam HI fragment containing UFA 1 t *om pKKl6 *C 
Kuchtar. R E, Sterne. J. Thorner. EMBOl 8. 3973 
ffflesj, was Igated into the Bam HI site of >Ep351 p. 
E. Hi, A. M. Myen. T.J. Koemer, A Tzagdoff. VtasT 
Z1630988S. 

24. J. Chant and L Hgrskowrtz. CeO 65. 1203(1991). 

25. B. W. Matthews, Act Cham. Ass. 21. 333 (1986). 

26. K. Kuchtar. H. G. DoHrnan, J. Thorner; J. CetBtoL 
120. 1203 (1993);aKoingand C P.HoBenbwg, 
EMBO J, 13. 3281 (1994); C. BerKower, 0. Loayza. 
S. Mfcrtaefis. A4ot Oof. Cats. 1 185 0994). 

27. A. Bander end J. R. Pringle.Aroc. NatLAcati. Set 
USA B6. 8976 (1989); J. Chant. K. Corrado. J. R. 
Prinpte, I. Herskowitz. Car 65. 1213 (1991); S. 
Powers, E Gonzales. T. Chrisiensen. J. Cuban, 0. 
Broek, flXtf., p. 1225; H. O. Park. J. Chant, I. Her- 
skowttz. Nature 365. 269 (1993); J- Chant. Trends 

Genet 10, 328 (1994); and J. R Prtngte. J. 

Cef Bo/. 129. 751 0995); JL Chant. M. Msehke, E. 
MitcneO. L Herskowitz. J. R. Pringle. OAT., p. 767. 

28. a F. Sprague Jr. Methods. Enymoi 194. 77 
099U 

29. Single ta tter a b breviation s for the amino add rasi- 
dues are as loflows: A, Ate; C. Cya; U Asp; E. GJu: F, 
Pho: a Qfy; K His: I. le; K, Lys; L. Leu; M. Met; N. 
Asa- P. Pro; Q. Gn- R Atk S. Ser. T. Thr; V, Vat; W, 
TrpiandY.Tyr. 

30. A W303 1A demote. SY2625 QMTm u«3-f leu2-3. 
n2frpMada?-f ani-i00ssti*mta2A.'jilSi-tacZ 

search. SY2625 o^tnbi^ tr ^ rnatrig aaaeya, ae- 
owd p herointji a) assays, and toe putee-ohasee^er* 
iments hduded tie fcaowhg stains: Y49 (sta2?-7). 
Y115 (fnte7A.-±aB). Y142 Ipdh-VWl. Y173 
(arfJAsLHJCZ). Y220 (axttrUX? «a23A.T(jR4JJ, Y221 
(sta£3a.-.'fJR49, Y231 (arfJasiatf stB23a^LBJS). 
and Y233 (sfe23&iaxa. MA fa ovmstiwjs of 
SY2625 friduded the totawng strains: Y199 
(SY2625 made MA To), Y278 Wa22W), Y195 
WalbzLEUZl. Y196 (w»A::t£U3. and Y197 
luflrW). The B3123 (W47a teu2 ua3 op r can? 
/v^geneticba^groundwBsusadtocreatsaoetor 
strains tor analysis of bud site selection. EQ123 de- 
rfcattvas included the totowtng strains: Y175 
(wrffArLfUE), Y223 WVJJRAQ. Y234 (sta23As 
IHJSI and Y272 \tx!1L:LEU2 sta23L':LEUZL 
MATo dartvalives of BG123 hduded the tc4owina 
strains: Y214 (EG123 made A44To) and Y293 
{fixntrlBJZi. Afl strains ware generated by means 
ot standard genetic or mdeeular metfiods tivoMng 
the appropriate constucts (23). In particular, 
ste23 double mutant straha ware created by cross- 
ing oi the appropriate MATa sn23 and MATo ax/T 
mutants, followed by sporuiationdtheresutartefp- 
loid and isolation of the double mutant from nonpe* 
rental d-type tetrads. Gene dsrupttons were con- 
firmed with either PCP. or Southern (DMA) analysis. 
31. pl29 is a YEp352 (JL E hB. A M Myers, T. J. Ko- 
emer. A. TzagdofT, Keasr 2, 163 (1966)) ptenid corv 
tartngaS^^Satl tagrnantof DAXL/.P151 was 
deriwjd torn pi 29 by Insertion d • Wwr at the Ball 
site wUhhAXLf. which tod to an frvtame haertionof 
the hemaggUWn frW) epitope <TX71ffWVPOYA)(2B) 
between arrano adds 854 and 655 of the AXL 1 prod- 



uct pC225 be KS+ (StrataganeJptasrTm^catahsu 
a034cbBamH^Ss< I taQmorttu><pA>l?. Subsb- 
tuobn rruatJons of the proposed active sta of Arfip 
««^Batod with the use of pC225 and dUKtpodfc 
f nJagsn assirMifWigaprjrDp^ 
ctoOWBS {SAfl-MSSA. 5'-GTGCTCACAAAGCGCT- 
GCC^AACCGGC-3'; *tff-£7M, S'-AAGAATCAT- 
GTGOSCACMAGGTGCG&3'; and wtf1-€7lD, 5'- 
AAGMTCATGTGATCACAAAGGTGCGC-3'). The 
nnutatiom wara confirmed by aeoAi^ 
ter* mutagenesis, the 0.4-kb Bam hS-Msc I fragment 
from the rruugenfeed pC225 ptasrrtds was trans, 
farrad nto p4H.f to create a set of pRS3l 6 rjtasrrids 
carrying oftarent AXL 7 eastas. pi 24 (fixn-H6&Al 
piso 1vn-€7iA). and pi32 (a*f;-f7;0). Smtoty. a 
sat of HA^agged afelas earned on Y&»352 ware cn> 
Btedrter re p teL « ii i ei a of the p!5l Bam W-Msc I 
fragment to generate pi 61 U^7-£7tA), pi62 (arfT- 



J^Oavis. T. Pavaro. C. at Hoog. and S. Kan tor 
cornmems on the manuscript Supported by a 
grant to Cfi. from the Natural Sdencas and Eno> 
nearing Research Coundi of Canada. Support to 
M.N A. was from a Caitornia Tobacco ndtedDa- 

S™ST* Pre8nm °°** K ~ m ***** 

22 June 1995: accepted 21 August 1995 



Quantitative Monitoring of Gene Expression 
Patterns with a Complementary DNA Microarray 

Mark Schena,* Dari Shalon^t Ronald W. Davis, 
Patrick O. Brown* 

A high-capaclty system was developed to monitor the expression of many genes In 
parallel. M.croarrays prepared by high-speed robotic printing of complementary DMAs on 
glass were used for quantitative expression measurements of the corresponding oen.es. 
Because of the small format and high density of the arrays, hybfidizatJon volumes of 2 
C0 " W be used that enabled detection of rare transcripts In probe mixtures 
derived from 2 micrograms of total cellular messenger RNA. Differentia! expression 
measurements of 45 Arabidopsis genes were made by means of simultaneous, two-color 
fluorescence hybridization. *"™™™««, Two-coior 



The remporal, developmental, topographi- 
cal, histological, and physiological patterns 
in which a gene is expressed provide clues to 
its biological role. The large and expanding 
database of complementary DNA (cDNA) 
sequences from many organisms (J) presents 
the opportunity of defining these patterns at 
the level of the whole genome. 

For these studies, we used the small flow, 
ering plant Arabidopsis ihaliana as a model 
organism. Arabidopsis possesses many ad- 
vantages for gene expression analysis, in- 
cluding the fact that it has the smallest 
genome of any- higher eukaryote examined 
to date (2). Forty-five cloned Arabidopsis 
cDNAs (Table 1), including 14 complete 
sequences and 31 expressed sequence tags 
(ESTs), were used as gene-specific targets. 
We obtained the ESTs by selecting cDNA 
clones at random from an Arabidopsis 
cDNA library. Sequence analysis revealed 
that 28 of the 31 ESTs matched sequences 
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in the database (Table I). Three additional 
cDNAs from other organisms served as con- 
trols in the experiments. 

The 48 cDNAs, averaging -1.0 kb, 
were amplified with the polymerase chain 
reaction (PCR) and deposited into indi- 
vidual wells of a 96-well microliter plate. 
Each sample was duplicated in two adja- 
cent wells to allow the reproducibility of 
the arraying and hybridization process to 
be tested. Samples from the microliter 
plate were printed onto glass microscope 
slides in an area measuring 3.5 mm by 5.5 
mm with the use of a high-speed arraying 
machine (3). The arrays were processed by 
chemical and heat treatment to attach the 
DNA sequences to the glass surface and 
denature them (3). Three arrays, printed 
in a single lot, were used for the experi- 
ments here. A single microttter plate of 
PCR products provides sufficient material 
to print at least 500 arrays. 

Fluorescent probes were prepared from 
total Arabidopsis mRNA (4) by a single 
round of reverse transcription (5). The Ara- 
bidopju mRNA was supplemented with hu- 
man acetylcholine receptor (AChR) mRNA 
at a dilution of 1 : 10.000 (w/w) before cDNA 
synthesis, to provide an internal standard for 
calibration (5). The resulting fluorescently 
labeled cDNA mixture was hybridized to an 
array at high stringency (6) and scanned 
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with a laser (3). A high-sensiriviry scan gave 
signals that saturated the detector at nearly 
all of the Arobidopsis target sites (Fig. 1A). 
Calibration relative to die AChR mRNA 
standard (Fig. IA) established a sensitivity 
limit of — 1 : 50,000. No detectable hybridiza- 
tion was observed to either the rat glucocor- 
ticoid receptor (Fig. IA) or the yeast TRP4 
(Fig. IA) targets even at the highest scan- 
ning sensitivity. A moderate-sensinvity scan 



of the same array allowed linear detection of 
the more abundant transcripts (Fig. IB). 
Quantitation of both scans revealed a range 
of expression levels spanning three orders of 
magnitude for the 45 genes tested (Table 2). 
RNA blots (7) for several genes (fig. 2) 
corroborated the expression levels measured 
with the microarray to within a factor of 5 
(Table 2). 

Differential gene expression was investi- 
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gated with a simultaneous, two-color hy- 
bridisation scheme, which served to mini- 
mixe experimental variation inherent in the 
comparison of independent hybridizations. 
Fluorescent probes were prepared from two 
mRNA sources with the use of reverse tran* 
scriptase in the presence of fluorescein- and 
lissarnine-labeled nucleotide analogs, re- 
spectively (5). The two probes were then 
mixed together in equal proportions, hy- . 
bridized to a single array, and scanned sep- 
arately for fluorescein and lissamine emis- 
sion after independent excitation of the two 
fluorophores (3). 

To test whether overexpression of a sin- 
gle gene could be detected in a pool of total 
Arobidopsis mRNA, we used a microarray to 
analyze a transgenic line overexpressing the 
single transcription factor HAT4 («). Fluo- 
rescent probes representing mRNA from 
wild-type and HAT4-transgenic plants were 
labeled with fluorescein and lissamine, re- 
spectively; the two probes were then mixed 
and hybridized to a single array. An intense 
hybridization signal was observed at the 
position of the HAT4 cDNA in the lissa- 
mine-specific scan (Fig. ID), but not in the 
fluorescein-specific scan of die same array 
(Fig. IC). Calibration with AChR mRNA 
added to the fluorescein and lissamine 
cDNA synthesis reactions at dilutions of 
1:10,000 (Fig. 1C) and 1:100 (Fig. ID), 
respectively, revealed a 50-fold elevation of 
HAT4 mRNA in the transgenic line rela- 
tive to its abundance in wild-rype piano 
(Table 2). This magnitude of HAT* over- 
expression matched that inferred from the 
Northern (RNA) analysis within a factor of 
2 (Fig. 2 and Table 2). Expression of ail the 
other genes monitored on the array differed 
by less than a factor of 5 between HAT4- 
transgenic and wild-rype piano (Fig 1, C 
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Flo. 1. Gene expression nx»nrt()rBd wim tn^ 

ps^ioocokycorresp^ 

^th*u»frfkn«^ 

person tre axes ma^ the posmon of each c^ 

wift fhjc)fesce^abeled cDNA derived from wild-type plants. (B) Same array as in (A) out scanned at 
moderate sensttrvtty. (C end D) A srtgte a/ray was probed with a 1 :1 mixture of fkx>resceir)-tabeiad cONA 
w^rype plants and fesaminejabeted cONA from HAT4-transgen»c plants. The single array was 

plants (Q end the Issarrtne fluorescence corresponding to mRNA from HAT4-transger»c plants ID) (E 
and F) A ^>f anay was probed wfth a 1:1 mixture of ftuorescetn-labeied cDNA from roottissue and 
Sssanro-tabeted cDNA from leal tissue. The single array was then scanned success™* to deled ihe 
fluoresce* florescence comssponrfng to mRNAs expressed m roots (E) and the issamine fluorescence 
wrresponoVig to mRNAs expressed n leaves (F). w« "uorocence 
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IX 0.1 O01 1.0 Q.1 OOi 
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AChR 



20 2-0 0.2 
mRNA (no) 

Rg. 2. Gene expression monitored with RNA 
(Northern) blot analysis. Designated amounts of 
mRNA from wild-type and HA74-transgenc 
plants were spotted onto nylon membranes and 
probed with the cONAs Indicated. Purified human 
AChR mRNA was used tor calibration. 
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and D, and Table 2). Hybridisation of flu. 
orcscein-labeled glucocorticoid receptor 
cDNA {Fig, 1C) and liisamine-labeled 
TRP4 cDNA (Fig. ID) verified the pre*- 
ence of the negative control target! and the 
lack of optical crass talk between the two 
fluorophores. 

To explore a more complex alteration in 
expression patterns, we performed a second 
two-color hybridization experiment with 
fluorescein- and lissamine-labeled probes 
prepared from root and leaf mRNA, respec- 
tively. The scanning sensitivities for the 
two fluorophores were normalized by 
matching the signals resulting from AQiR 



mRNA, which was added to both cDNA 
synthesis reactions at a dilution of 1:1000 
(Fig. 1, E and F). A comparison of the scans 
revealed widespread differences in gene ex- 
pression between root and leaf tissue (Fig. 1, 
E and F). The mRNA from the light-regu- 
Uted CAB/ gene was -500-fold more abun- 
dant in leaf (Fig. IF) than in root tissue 
(Hg. IE). The expression of 26 other genes 
differed between root and leaf tissue by 
more than a factor of 5 (Fig. 1, E and F). 

The HAT4-transgenic line we examined 
has elongated hypocotyb, early flowering, 
poor germination, and altered pigmentation 
io). Although changes in expression were 

Table 1. Sequences contained on the cDNA microa/rav Shown is thp rw*i™ t^, 

function, and the accession number of each cONAintihe^ni^MrraWJ^T^ i^fn^!* i^L^***^ ? P^®^^ 5 
* this study matched a sequence in the ^^^T^^ l'r^ °' *? 6818 ^ 
*uidec*oe; ATPase. adeno^ne triphosphaSs^ ncot™r«de adenine 



Position 



cONA 



Function 



.12 



a1,2 
a3. 4 
aS. 6 
a7.8 
a9, 10 
all, 12 
D1.2 
03.4 
D5.6 
b7,8 
09, 10 
b11. 12 
CI. 2 
C3, 4 
C5.6 

C7.8 

CS. 10 

C11. 

d1.2 

d3.4 

65,6 

d7, 8 

d9, 10 

d11.12 

61.2 

e3 t 4 

eS.6 

67,8 

e9, 10 

611. 

M.2 

«3,4 

15,6 

f7,8 

f9. 10 

111,12 

01.2 
g3.4 
95.6 
07.8 
99, 10 
011. 12 
M,2 
h3.4 
h5,6 
h7,8 
h9, 10 
M1.12 



number 



. 12 



AChR 
EST3 
EST6 
AAC1 
EST12 
EST13 
CAB 
EST17 
GA4 
EST19 
GBF'1 
EST23 
EST29 
GBF-2 
EST34 
EST35 
EST41 
rGR 
EST42 
EST45 
HAT1 
EST46 
EST49 
HAT2 
HAT4 
EST50 
HA 75 
EST51 
HAT22 
ESTS2 
EST59 
KNATl 
EST60 
EST69 
PPH1 
EST70 
EST75 
EST 78 
ROCI 
EST82 
ESTB3 
EST84 
EST91 
EST96 
SARI 
EST100 
EST 103 
7RP4 



Human AChR 
Actin 

NADH ctehydrogenase 

Actin 1 

Unknown 

Actin 

ChtaophyUa/bbtifrxj 
Phospriogiycerate kinase 
Gtoberefijc add biosynthesis 
Untaown 

G-bc* binding factor 1 
Bongation factor 
Aldolase 

G-box binding factor 2 
Chtoropiast protease 
Unknown 



'Proprietary a 



Bat glucocorticoid receptor 
Unknown 
ATPase 

Homeobox-leucine zpper i 
Ligrrt harvesting compter 
Unknown 

Homeobca^eucine zipper 2 
Horrieobox-teucine zipper 4 
Ptosprxarirxjtotanase 
Honeotxjx-teucine rpper 5 
Unknown 

Hc<neobox-*eucine zipper 22 
Oxygen evolving 
Unknown 

Koofred-Eke homeobox 1 
RuBisCO smal subunit 
Translation elongation factor 
Protein phosphatase 1 
Unknown 

Chtoropiast protease 
Unknown 
Cydophin 
GTP binding 
Unknown 
Unknown 
unknown 
Unknown 
Syriaptobrevtn 
Light harvesting complex 
Ughl harvesting complex 

Veast tryp tophan biosynthesis 

vmctStntow^j^CMon*). tNo match in the database; rxvel EST. 



H36236 

227010 

M20016 

U36594T 

T4S783 

MB5150 

T44490 

L37126 

U3B595t 

X63894 

X52256 

T04477 

X63895 

R87034 

T14152 

T22720 

M14053 

U36596t 

J04185 

U09332 

T04063 

T76267 

U09335 

M90394 

T04344 

M90416 

233675 

U09336 

T21749 

234607 

U14174 

X14564 

T42799 

U34803 

T44621 

T43698 

R65481 

L14844 

X59152 

233795 

T45276 

T13832 

R64816 

NI90418 

218205 
X03909 
X04273 



observed for HAT4 t large changes in «. 
pression were not observed for any of the 
other 44 genes we examined This was 
somewhat surprising, particularly because 
comparative analysis of leaf and root tissue 
identified 27 differentially expressed genes. 
Analysis of an expanded act of genes may be 
required to identify genes whose expression 
changes upon HAT4 overexpression; alter, 
natively, a comparison of mRNA popula- 
twns from specific tissues of wild-type and 
HAT4-transgenic plants may allow identi- 
fication of downstream genes. 

At the current density of robotic printing 
it is feasible to scale up the fabrication pro- 
cess to produce arrays cexuaining 20,000 
cDNA targets. At this density, a single array 
would be sufficient to provide gene-specific 
targets encompassing nearly die entire rep- 
ertoire of expressed genes in the Ardxdopsis 
genome (2). The availability of 20,274 ESTs 
from Arabuiopsis (J, 9) would provide a rich 
source of templates for such studies. 

The estimated 100,000 genes in the hu- 
man genome (10) exceeds the number of 
Arabidopsis genes by a factor of 5 (2). This 
modest increase in complexity suggests that 
similar cDNA microarrays, prepared from 
the rapidly growing repertoire of human 
ESu (J), could be used to determine the 
expression patterns of tens of thousands of 
human genes in diverse cell types. Coupling 
an amplification strategy to the reverse 
transcription reaction (J I) could make it 
feasible to monitor expression even in 
minute tissue samples. A wide variety of 
acute and chronic physiological and patho- 
logical conditions might lead to character- 
istic changes in the patterns of gene expres- 
sion in peripheral blood cells or other easily 
sampled tissues. In concert with cDNA mi- 
croarrays for monitoring complex expres- 
sion patterns, these tissues might therefore 
serve as sensitive in vivo sensocs for clinical 
diagnosis. Microarrays of cDNAs could thus 
provide a useful link between human gene 
sequences and clinical medicine. 

Table 2. Gene expression rixxiitcringbyrrscw 
ray and RNAbtot analyses; tg, HA&tamrft: 
See Table 1 tor additional gene Worrnation^: 
presscn levels (w/w) were calibrated with the use 
ol Known amounts of human AChfl mRNA. Values 
lor the mrcroarrey were c^temwed from mfcroar - 
ray scans (Fig. 1); values tor the RNA btot were 
determined from RNA blots (Rg 2) 
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Expression teveJ (w/w) 
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Microarray 


RNAbtot 


CAB 
C4fl/(tpj 
HAT4 
HAT4 (tg) 
ROCI 
ROCI (tg) 


1:48 

1:120 

1:8300 

1:150 

1:1200 

1260 


1:83 

1:150 

1:6300 

1210 

1:1800 

1:1300 
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Gene Therapy in Peripheral Blood 
Lymphocytes and Bone Marrow for 
ADA Immunodeficient Patients 

Claudio Bordignon,* Luigi D. Notarangelo, Nadia Nobili, 
Qiuhana Ferrari, Giulia Casorati, Paola Panina, Evelina Mazzolari 
Daniela Maggioni, Claudia Rossi, Paolo Seivida, 
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toe^Z^?™^^ «*!* mnc y results ir » severe combined Immunodefciency, 
^J^™ C ti * or6 "J**«> ^ gene therapy. Two different retroviral vectors We 
used to transfer ex vivo the human ADA minlgene into bone marrow celts anoWphen^ 

f«H ~! . y . °^ treatment, long-term survival of T and B lymphocytes, marrow cells. 
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t ™ [y/nphocytes, were progressively replaced by marrow-derived T cells in both pa- 
tients. These results indicate successful gene transfer into long-lasting progenitor ceHs 
producing a functional multilineage progeny. progenitor ceils, 



Severe combined immunodeficiency asso- 
ciated with inherited deficiency of ADA 
(]) is usually fetal unless affected children 
are kept in protective isolation or the im- 
mune system is reconstituted by bone mar- 
row transplantation from a human leuko- 
cyte antigen (HLAHdentical sibling donor 
{21 This is the therapy of choice, although 
it is available only for a minority of patients. 
In recent yean, other forms of therapy have 
been developed, including transplants from 
haploidentical donors (3,4), exogenous cn- 
ryme replacement (5), and somatic-cell 
gene therapy (6-9). 

We previously reported a preclinical mod- 
el in which ADA gene transfer and expression 
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Miccessfully restored immune functions in hu- 
man ADA-deficient (ADA") peripheral 
blood lymphocytes (PBLs) in immunodefi- 
cient mice in vivo (J0 f I J J. On the basis of 
these preclinical results, the clinical appltca- 
lion of gene oSerapy for the treatment of 
ADA" SOD (severe combined rnimurodefi- 
ciency disease) patients who previously failed 
exogenous enxyme replacement therapy was 
approved by our Institutional Ethical Com- 
mittees and by the Italian National Commit- 
tee for Bioethics (12). In addition to evaluat- 
ing the safety and efficacy of the gene therapy 
procedure, the aim of the study was to define 
. the relative role of PBLs and hematopoietic 
stem cells in the long-term reconsritution of 
immune functions after retroviral vector-me- 
diated ADA gene transfer. For this purpose, 
two structurally identical vectors expressing 
the human ADA complementary DNA 
(cDNA), distinguishable by the presence of 
alternative restriction sites in a nonfunctional 
region of the viral long-terminal repeat 
(LTR), were used to transduce PBLs and bone 
marrow (BM) cells independently. This pro- 
cedure allowed identification of the origin of 
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ABSTRACT _ cDNA microarray technology is used to profile 
complex diseases and discover novel disease-related genes. In 
inflammatory, disease such as rheumatoid arthritis, expression 
patterns of; diverse cell types contribute to the pathology. We 
have monitored gene expression in this disease state with a 
microarray of selected human genes of probable significance in 
inflammation as well as with genes expressed in peripheral 
human blood cells. Messenger RNA from cultured macrophages, 
chondrocyte cell, lines, primary chondrocytes, and synoviocytes 
provided expression profiles for the selected cytokines, chemo- 
kines, DNA binding proteins, and matrix-degrading metal- 
loproteinases. Comparisons between tissue samples of rheuma- 
toid arthritis and inflammatory bowel disease verified the in- 
volvement of many genes and revealed, novel participation of the 
cytokine interleukin 3, chemokine Groa and the metal- 
loproteinase matrix metallo-elastase in both diseases. From the 
peripheral blood library, tissue inhibitor of metalloproteinase 1, 
ferritin light chain, and manganese superoxide dismutase genes 
were identified as expressed differentially in rheumatoid arthri- 
tis compared with inflammatory bowel disease. These results 
successfully demonstrate the use of the cDNA microarray system 
as a general approach for dissecting human diseases. ' 

The recently described cDNA microarray or DNA-chip tech- 
nology allows expression monitoring of hundreds and thou- 
sands of genes simultaneously and provides a' format for 
identifying genes as well as changes in their, activity (1, 2). 
Using this technology, two-color fluorescence patterns of 
differential gene expression in the root versus the shoot tissue 
of Arabidopsis were obtained in a specific array of 48 genes (1). 
In another study, using a 1000 gene array from a human 
peripheral blood library, novel genes expressed by T cells were 
identified upon heat shock and protein kinase C activation (3). 

The technology uses cDNA sequences or cDNA inserts of a 
library for PCR amplification that are arrayed on a glass slide with 
high speed robotics at a density of 1000 cDNA sequences per cm 2 . 
These microarrays serve as gene targets for hybridization to 
cDNA probes prepared from RNA samples of cells or tissues. A 
two-color fluorescence labeling technique is used in the prepa- 
ration of the cDNA probes such that a simultaneous hybridization 
but separate detection of signals provides the comparative anal- 
ysis and the relative abundance of specific genes expressed (1, 2). 
Microarrays can be constructed from specific cDNA clones of 
interest, a cDNA library, or a select number of open reading 
frames from a genome sequencing database to allow a large-scale 
functional analysis of expressed sequences. 
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Because (Of.thcj wide spectrum of genes, and endogenous 
mediators involved, the microarray . technology", is well suited 
for analyzing chronic 'diseases. In rheumatoid arthritis (RA), 
inflammation of the joint is caused by the gene products of 
many different cell types present in the synovium and cartilage 
tissues plus; those 1 infiltrating from the circulating blood.. The 
autoimmune arid- inflammatory nature of the" disease is a 
cumulative result of genetic susceptibility factors and multiple 
respohses, ' paracrine and autocrine in ! nature, from 1 macro- 
phages, T cells, plasma cells, neutrophils, synovial fibroblasts, 
chondrocytes, etc.' Growth factors, inflammatory cytokines 
(4), arid the chemokines (5) are the important mediators' of this 
inflammatory process! The ensuing destruction of the cartilage 
and bone^by the invading synovial tissue includes the actions 
of prostaglandins and leukotrienes (6), and the matrix degrad- 
ing metalloproteinases (MMPs). The MMPs are an important 
class of Zn-dependent metallo-endoproteinases that can col- 
lectively degrade the proteogrycanand collagen components of 
the connective tissue -matrix (7). 

This paper presents a study in which the involvement of 
select classes of molecules in RA was examined. Also inves- 
tigated were. 1000 human genes randomly selected from a 
peripheral human blood cell library. Their differential and 
quantitative; expression analysis in cells of the joint tissue, in 
diseased. R A tissue, and in inflammatory bowel disease (IBD) 
tissues was conducted to demonstrate the utility of the mi- 
croarray method to analyze complex diseases by their pattern 
of gene expression. Such a survey provides insight not only into 
the underlying cause of the pathology, but also provides the 
opportunity to selectively target genes for disease intervention 
by appropriate drug development and gene therapies. 

METHODS 

Microarray Design, Development, and Preparation. Two ap- 
proaches for the. fabrication of cDNA microarrays were used in 
this study. In the first approach, known human genes of probable 
significance in R A were identified. Regions of the clones, pref- 
erably 1 kb in length, were selected by their proximity to the 3' end 
of the cDNA and for areas of least identity to related and 
repetitive sequences. Primers were synthesized to amplify the 
target regions by standard PCR protocols (3). Products were 



Abbreviations: RA, rheumatoid arthritis; MMP, matrix-degrading 
metalloproteinase; IBD, inflammatory bowel disease; LPS, I ipo poly- 
saccharide; PMA, phorbol 12-myristate 13-acetate; TNF-a, tumor 
necrosis factor a; IL, interleukin; TGF-jS, transforming growth factor 
0; GCSF, granulocyte colony-stimulating factor; MIP, macrophage 
inflammatory protein; MIF, migration inhibitory factor; HME, human 
matrix metallo-elastase; RANTES, regulated upon activation, normal 
T cell expressed and secreted; Gel, gelatinase; VCAM, vascular cell 
adhesion molecule; ICE, IL-1 converting enzyme; PUMP, putative 
metalloproteinase; MnSOD, manganese superoxide dismutase; TIMP, 
tissue inhibitor of metalloproteinase; MCP, macrophage chemotactic 
protein. 
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verified by gel electrophoresis and purified with Qiaquick 96-well 
purification kit (Qiagen, Chatsworth, CA), hyophilized (Savant), 
and resuspended in 5 /il of 3 X standard saline citrate (SSC) buffer 
for arraying. In the second approach, the microarray containing 
the 1056 human genes from the peripheral blood lymphocyte 
library was prepared as described (3). 

Tissue Specimens. Rheumatoid synovial tissue was obtained 
from patients with late stage classic RA undergoing remedial 
synovectomy or arthroplasty of the knee. Synovial tissue was 
separated from any associated connective tissue or fat. One 
gram of each synovial specimen was subjected to RNA extrac- 
tion within 40 min of surgical excision, or explants were 
cultured in serum-free medium to examine any changes under 
in vitro conditions. For IBD, specimens of macroscopically 
inflamed lower intestinal mucosa were obtained from patients 
with Crohn disease undergoing remedial surgery. The hyper- 
trophied mucosal tissue was separated from underlying con- 
nective tissue and extracted for RNA. 

Cultured Cells. The Mono Mac-6 (MM6) monocytic cells 
(8) were grown in RPMI medium. Human chondrosarcoma 
SW1353 cells, primary human chondrocytes, and synoviocytes 
(9, 10) were cultured in DMEM; all culture media were 
supplemented with 10% fetal bovine serum, 100 /utg/ml strep- 
tomycin, and 500 units/ml penicillin. Treatment of cells with 
lipopolysaccharide (LPS) endotoxin at 30 ng/ml, phorbol 
12-myristate 13-acetate (PMA) at 50 ng/ml, tumor necrosis 
factor a (TNF-a) at 50 ng/ml, interleukin (IL)-lj3 at 30 ng/ml, 
or transforming growth factor-j3 (TGF-J3) at 100 ng/ml is 
described in the figure legends. 



Fluorescent Probe, Hybridization, and Scanning. Isolation of 
mRNA, probe preparation, and quantitation with Arabidopsis 
control mRNAs was essentially as described (3) except for the 
following minor modification. Following the reverse transcriptase 
step, the appropriate Cy3- and Cy5-labeled samples were pooled; 
mRNA degraded by heating the sample to 65°C for 10 min with 
the addition of 5 /xl of 05M NaOH plus 0.5 ml of 10 mM EDTA. 
The pooled cDNA was purified from unincorporated nucleotides 
by gel filtration in Centri-spin columns (Princeton Separations, 
Adelphia, NJ). Samples were lyophilized and dissolved in 6 /tl of 
hybridization buffer (5x SSC plus 02% SDS). Hybridizations, 
washes, scanning, quantitation procedures, and pseudocolor rep- 
resentations of fluorescent images have been described (3). Scans 
for the two fluorescent probes were normalized either to the 
fluorescence intensity of Arabidopsis mRNAs spiked into the 
labeling reactions (see Figs. 2-4) or to the signal intensity of 
j3-actin and glyceraldehyde-3-phosphate dehydrogenase 
(GAPDH; see Fig. 5). 

RESULTS 

Ninety-Six-Gene Microarray Design. The actions of cytokines, 
growth factors, chemokines, transcription factors, MMPs, pros- 
taglandins, and leukotrienes are well recognized in inflammatory 
disease, particularly R A (1 1-14). Fig. 1 displays the selected genes 
for this study and also includes control cDNAs of housekeeping 
genes such as )3-actin and GAPDH and genes from Arabidopsis 
for signal normalization and quantitation (row A, columns 1-12). 

Defining Microarray Assay Conditions. Different lengths and 
concentrations of target DNA were tested by arraying PCR- 
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amplified products ranging from 0.2 to 1.2 kb at concentrations 
of 1 iLg/iti or less. No significant difference in the signal levels was 
observed within this range of target size and only with 0.2-kb 
length was a signal reduced upon an 8-fold dilution of the 1 /ng/pd 
sample (data not shown). In this study the average length of the 
targets was 1 kb, with a few exceptions in the range of <*300 bp, 
arrayed at a concentration of 1 /ig/pd. Normally one PCR pro- 
vided sufficient material to fabricate up to 1000 microarray targets. 

In considering positional effects in the development of the 
targets for the microarrays, selection was biased toward the 3' 
proximal regions, because the signal was reduced if the target 
fragment was biased toward the 5' end (data not shown). This 
result was anticipated since the hybridizing probe is prepared by 
reverse transcription with oligo(dT)-primed mRNA and is richer 
in 3' proximal sequences. Cross-hybridizations of probes to 
targets of a gene family were analyzed with the matrix metal- 

A. 

uniiulucocl 



loproteinases as the example because they can show regions of 
sequence identities of greater than 70%. With collagenase-1 
(Col-1) and coUagenase-2 (Col-2) genes as targets with up to 70% 
sequence identity, and stromerysin-1 (Strom-1) and stromerysin-2 
(Strom-2) genes with different degrees of identity, our results 
showed that a short region of overlap, even with 70-90% se- 
quence identity, produced a low level of cross-hybridization. 
However, shorter regions of identity spread over the length of the 
target resulted in cross-hybridization (data not shown). For 
closely related genes, targets were designed by avoiding long 
stretches of homology. For members of a gene family two or more 
target regions were included to discriminate between specificity 
of signal versus cross-hybridization. 

Monitoring Differential Expression in Cultured Cell Lines. In 
RA tissue, the monocyte/macrophage population plays a prom- 
inent role in phagocytic and immunomodulatory activities. Typ- 
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Fig. 2. Time course for LPS/PMA-induced MM6 cells. Array elements are described in Fig. 1. (A) Pseudocolor representations of fluorescent 
scans correspond to gene expression levels at each time point. The array is made up of SArabidopsis control targets and 86 human cDNA targets, 
the majority of which are genes with known or suspected involvement in inflammation. The color bars provide a comparative calibration scale 
between arrays and are derived from the Arabidopsis mRNA samples that are introduced in equal amounts during probe preparation. Fluorescent 
probes were made by labeling mRNA from untreated MM6 cells or LPS and PM A treated cells. mRNA was isolated at indicated times after 
induction. (B /-///) The two-color samples were cohybridized, and microarray scans provided the data for the levels of select transcripts at different 
time points relative to abundance at time zero. The analysis was performed using normalized data collected from 8-bit images. 
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ically these cells, when triggered by an immunogen, produce the 
proinflammatroy cytokines TNF and IL-1. We have used the 
monocyte cell line MM6 and monitored changes in gene expres- 
sion upon activation with LPS endotoxin, a component of Gram- 
negative bacterial membranes, and PMA, which augments the 
action of LPS on TNF production (15). RNA was isolated at 
different times after induction and used for cDNA probe prep- 
aration. From this time course it was clear that TNF expression 
was induced within 15 min of treatment, reached maximum levels 
in 1 hr, remained high until 4 hr and subsequently declined (Fig. 
24). Many other cytokine genes were also transiently activated, 
such as IL-la and -ft IL-6, and granulocyte colony-stimulating 
factor (GCSF). Prominent chemokines activated were H^8, mac- 
rophage inflammatory protein (MIP)-lj3, more so than MlP-la, 
and Groa or melanoma growth stimulatory factor. Migration 
inhibitory factor (MIF) expressed in the uninduced state declined 
in LPS-activated cells. Of the immediate early genes, the notice- 
able ones were c-fos t fra-l, c-jun, NF-*Bp50, and IkB, with c~rei 
expression observed even in the uninduced state (Fig. 2B). These 
expression patterns are consistent with reported patterns of 
activation of certain LPS- and PMA-induced genes (12). Dem- 
onstrated here is the unique ability of this system to allow parallel 
visualization of a large number of gene activities over a period of 
time. 

SW1353 cells is a line derived from malignant tumors of the 
cartilage and behaves much like the chondrocytes upon stim- 
ulation with TNF and IL-1 in the expression of MMPs (9). In 
addition to confirming our earlier observations with Northern 
blots on Strom-1, Col-1, and Col-3 expression (9), gelatinase 
(Gel) A, putative metalloproteinase (PUMP)-l membrane- 





type matrix metalloproteinase, tissue inhibitors of matrix 
metalloproteinases or tissue inhibitor of metalloproteinase 1 
(TIMP-1), -2, and -3 were also expressed by these cells together 
with the human matrix metallo-elastase (HME; Fig. 3A). HME 
induction was estimated to be ~50-fold and was greater than 
any of the other MMPs examined (Fig. 3B). This result was 
unexpected because HME is reportedly expressed only by 
alveolar macrophage and placental cells (16). Expression of 
the cytokines and chemokines, IL-6, IL-8, MIF, and MIP-10 
was also noted. A variety of other genes, including certain 
transcription factors, were also up-regulated (Fig. 3), but the 
overall time-dependent expression of genes in the SW1353 
cells was qualitatively distinct from the MM6 cells. 

Quantitation of differential gene expression (Figs. IB and 
3B) was achieved with the simultaneous hybridization of 
Cy3-labeled cDNA from untreated cells and Cy5-labeled 
cDNA from treated samples. The estimated increases in 
expression from these microarrays for a select number of genes 
including IL-1/3, IL-8, MIP-1/3, TNF, HME, Col-1, Col-3, 
Strom-1, and Strom-2 were compared with data collected from 
dot blot analysis. Results (not shown) were in close agreement 
and confirmed our earlier observations on the use of the 
microarray method for the quantitation of gene expression (3). 

Expression Profiles in Primary Chondrocytes and Synovio- 
cytes of Human RA Tissue. Given the sensitivity and the 
specificity of this method, expression profiles of primary 
synoviocytes and chondrocytes from diseased tissue were 
examined. Without prior exposure to inducing agents, low level 
expression of c-jun, GCSF, IL-3, TNF-/3, MIF, and R ANTES 
(regulated upon activation, normal T cell expressed and se- 
creted) was seen as well as expression of MMPs, GelA, 
Strom-1, Col-1, and the three TIMPs. In this case, Col-2 
hybridization was considered to be nonspecific because the 
second Col-2 target taken from the 3' end of the gene gave no 

A. Human synovial fibroblasts B. Human articular chondrocytes 
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Fig. 3. Time course for IL-1/3 and TNF-induced SW1353 cells 
using the inflammation array (Fig. 1). (A) Pseudocolor representation 
of fluorescent scans correspond to gene expression levels at each time 
point. (B I-IV) Relative levels of selected genes at different time points 
compared with time zero. 



Fig. 4. Expression profiles for early passage primary synoviocytes and 
chondrocytes isolated from RA tissue, cultured in the presence of 10% 
fetal calf serum and activated with PMA and IL-10, or TNF and IL-1/3, 
or TGF-/3 for 18 hr. The color bars provide a comparative calibration scale 
between arrays and are derived from Xh&Arabidopsis mRNA samples that 
are introduced in equal amounts during probe preparation 
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signal. Treatment more so with PMA and IL-1, than TNF and 
IL-1, produced a dramatic up-regulation in expression of 
several genes in both of these primary cell types. These genes 
are as follows: the cytokine IL-6, the chemokines IL-8 and 
Gro-la, and the MMPs; Strom-1, Col-1, Col-3, and HME; and 
the adhesion molecule, vascular cell adhesion molecule 1 
(VCAM-1). The surprise again is HME expression in these 
primary cells, for reasons discussed above. From these results, 
the expression profiles of synoviocytes and the chondrocytes 
appear very similar; the differences are more quantitative than 
qualitative. Treatment of the primary chondrocytes with the 
anabolic growth factor TGF-j3 had an interesting profile in that 
it produced a remarkable down-regulation of genes expressed 
in both the untreated and induced state (Fig. 4). 

Given the demonstrated effectiveness of this technology, a 
comparative analysis of two different inflammatory disease 
states was conducted with probes made from RA tissue and 
IBD samples. RA samples were from late stage rheumatoid 
synovial tissue, and IBD specimens were obtained from in- 
flamed lower intestinal mucosa of patients with Crohn disease. 
With both the 96-element known gene microarray and the 
1000-gene microarray of cDNAs selected from a peripheral 
human blood cell library (3), distinct differences in gene 
expression patterns were evident. On the 96-gene array, RA 
tissue samples from different affected individuals gave similar 
profiles (data not shown) as did different samples from the 
same individual (Fig. 5). These patterns were notably similar 
to those observed with primary synoviocytes and chondrocytes 
(Fig. 4). Included in the list of prominently up-regulated genes 
are IL-6, the MMPs Strom-1, Col-1, GelA, HME, and in 
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Fig. 5. Expression profiles of RA tissue (A) and IBD tissue (B). 
mRNA from RA tissue samples obtained from the same individual was 
isolated directly after excision (RA 21.5A) or maintained in culture 
without serum for 2 hr (RA 21.5B) or for 6 hr (RA 21.5C). Profiles 
from tissue samples of two other individuals (data not shown) were 
remarkably similar to the ones shown here. IBD- A and IBD-CI are 
from mRNA samples prepared directly after surgery from two sepa- 
rate individuals. For the IBD-CII probe, the tissue sample was cultured 
in medium without serum for 2 hr before mRNA preparation. 



certain samples PUMP, TIMPs, particularly TIMP-1 and 
TIMP-3, and the adhesion molecule VCAM. Discernible levels 
of macrophage chemotactic protein 1 (MCP-1), MIF and 
R ANTES were also noted. IBD samples were in comparison, 
rather subdued although IL-1 converting enzyme (ICE), 
TIMP-1, and MIF were notable in all the three different IBD 
samples examined here. In IBD-A, one of three individual 
samples, ICE, VCAM, Groa, and MMP expression was more 
pronounced than in the others. 

We also made use of a peripheral blood cDNA library (3) 
to identify genes expressed by lymphocytes infiltrating the 
inflamed tissues from the circulating blood. With the 1046- 
element array of randomly selected cDNAs from this library, 
probes made from R A and IBD samples showed hybridizations 
to a large number of genes. Of these, many were common 
between the two disease tissues while others were differentially 
expressed (data not shown). A complete survey of these genes 
was beyond the scope of this study, but for this report we 
picked three genes that were up-regulated in the RA tissue 
relative to IBD. These cDNAs were sequenced and identified 
by comparison to the GenBank database. They are TIMP-1, 
apoferritin light chain, and manganese superoxide dismutase 
(MnSOD). Differential expression of MnSOD was only ob- 
served in samples of R A tissue explants maintained in growth 
medium without serum for anywhere between 2 to 16 hr. These 
results also indicate that the expression profile of genes can be 
altered when explants are transferred to culture conditions. 

DISCUSSION 

The speed, ease, and feasibility of simultaneously monitoring 
differential expression of hundreds of genes with the cDNA 
microarray based system (1-3) is demonstrated here in the 
analysis of a complex disease such as RA. Many different cell 
types in the RA tissue; macrophages, lymphocytes, plasma cells, 
neutrophils, synoviocytes, chondrocytes, etc. are known to con- 
tribute to the development of the disease with the expression of 
gene products known to be proinflammatory. They include the 
cytokines, chemokines, growth factors, MMPs, eicosanoids, and 
others (7, 11-14), and the design of the 96-element known gene 
microarray was based on this knowledge and depended on the 
availability of the genes. The technology was validated by con- 
firming earlier observations on the expression of TNF by the 
monocyte cell line MM6, and of Col-1 and Col-3 expression in the 
chondrosarcoma cells and articular chondrocytes (9, 12). In our 
time-dependent survey the chronological order of gene activities 
in and between gene families was compared and the results have 
provided unprecedented profiles of the cytokines (TNF, IL-1, 
IL-6, GCSF, and MIF), chemokines (MIP-lo, MIP-1/3, IL-8, and 
Gro-1), certain transcription factors, and the matrix metal- 
loproteinases (GelA, Strom-1, Col-1, Col-3, HME) in the mac- 
rophage cell line MM6 and in the SW1353 chondrosarcoma cells. 

Earlier reports of cytokine production in the diseased state had 
established a model in which TNF is a major participant in RA. 
Its expression reportedly preceded that of the other cytokines and 
effector molecules (4). Our results strongly support these results 
as demonstrated in the time course of the MM6 cells where TNF 
induction preceded that of ILla and followed by IL-6 and 
GCSF. These expression profiles demonstrate the utility of the 
microarrays in determining the hierarachy of signaling events. 

In the SW1353 chondrosarcoma cells, all the known MMPs and 
TIMPs were examined simultaneously. HME expression was 
discovered, which previously had been observed in only the 
stromal cells and alveolar macrophages of smoker's lungs and in 
placental tissue. Its presence in cells of the RA tissue is mean- 
ingful because its activity can cause significant destruction of 
elastin and basement membrane components (16, 17). Expression 
profiles of synovial fibroblasts and articular chondrocytes were 
remarkably similar and not too different from the SW1353 cells, 
indicating that the fibroblast and the chondrocyte can play equally 
aggressive roles in joint erosion. Prominent genes expressed were 
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the MMPs, but chemokines and cytokines were also produced by 
these cells. The effect of the anabolic growth factor TGF-0 was 
profoundly evident in demonstrating the down regulation of these 
catabolic activities. 

RA tissue samples undeniably reflected profiles similar to 
the cell types examined. Active genes observed were IL-3, IL-6, 
ICE, the MMPs including HME and TIMPs, chemokines IL-8, 
Groa, MIP, MIF, and RANTES, and the adhesion molecule 
VCAM. Of the growth factors, fibroblast growth factor /3 was 
observed most frequently. In comparison, the expression 
patterns in the other inflammatory state (i.e., IBD) were not 
as marked as in the R A samples, at least as obtained from the 
tissue samples selected for this study. 

As an alternative approach, the 1046 cDNA microarray of 
randomly selected genes from a lymphocyte library was used to 
identify genes expressed in RA tissue (3). Many genes on this 
array hybridized with probes made from both R A and IBD tissue 
samples. The results are not surprising because inflammatory 
tissue is abundantly supplied with cell types infiltrating from the 
circulating blood, made apparent also by the high levels of 
chemokine expression in RA tissue. Because of the magnitude of 
the effort required to identify all the hybridized genes, we have for 
this report chosen to describe only three differentially expressed 
genes mainly to verify this method of analysis. 

Of the large number of genes observed here, a fair number 
were already known as active participants in inflammatory dis- 
ease. These are TNF, IL-1, IL-6, IL-8, GCSF, RANTES, and 
VCAM. The novel participants not previously reported are 
HME, IL-3, ICE, and Groa. With our discovery of HME 
expression in RA, this gene becomes a target for drug interven- 
tion. ICE is a cysteine protease well known for its IL-lj3 process- 
ing activity (18), and recognized for its role in apoptotic cell death 
(19). Its expression in RA tissue is intriguing. IL-3 is recognized 
for its growth-promoting activity in hematopoietic cell lineages, is 
a product of activated T cells (20), and its expression in synovio- 
cytes and chondrocytes of RA tissue is a novel observation. 

Like IL-8, Groa, is a C-X-C subgroup chemokine and is a 
potent neutrophil and basophil chemoattractant. It down- 
regulates the expression of types I and III interstitial collagens 
(21, 22) and is seen here produced by the MM6 cells, in primary 
synoviocytes, and in R A tissue. With the presence of RANTES, 
MCP, and MIP-10, the C-C chemokines (23) migration and 
infiltration of monocytes, particularly T cells, into the tissue is 
also enhanced (5) and aid in the trafficking and recruitment of 
leukocytes into the RA tissue. Their activation, phagocytosis, 
degranulation, and respiratory bursts could be responsible for 
the induction of MnSOD in RA. MnSOD is also induced by 
TNF and IL-1 and serves a protective function against oxida- 
tive damage. The induction of the ferritin light chain encoding 
gene in this tissue may be for reasons similar to those for 
MnSOD. Ferritin is the major intracellular iron storage protein 
and it is responsive to intracellular oxidative stress and reactive 
oxygen intermediates generated during inflammation (24, 25). 
The active expression of TIMP-1 in R A tissue, as detected by 
the 1000-element array, is no surprise because our results have 
repeatedly shown TIMP-1 to be expressed in the constitutive 
and induced states of RA cells and tissues. 

The suitability of the cDNA microarray technology for 
profiling diseases and for identifying disease related genes is 
well documented here. This technology could provide new 



targets for drug development and disease therapies, and in 
doing so allow for improved treatment of chronic diseases that 
are challenging because of their complexity. 
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Pharmagene 
Raises More 
Capital for 
Research on 
Human 
Tissues 

By Sophia Fox 

Ik mharinageae, the Royston, 
flllC-based biopharmaceim- 
JL cal company specialising in 
the use of human btonvttcn&ls for 
drug discovery research, has raised a 
further £5 rruDton from s group of 
investors led by 3i and Abacus 
Nominees. The funding will enable 
the company to expand both its 
human biomatenals collection and 
its capabilities across a range of pro- 
prietary platform technologies. 

Gordon Baxter. Ph.D., 
Pharmagene s cofbunder and chief 
operating officer, claimed, Tjy the 
end of this year Pharmagene will 
have access to the largest collection 
of human RNAs and proteins any- 
where in the world, and a range of 
innovative, yet robust technologies 
SEE PHARMAGENE, P. • 
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Perkm-Elmer Acquires PerSeptive to Expand 
Its Capabilities in Gene-BasedDrug Discovery 



By John Sterling 

Peridn-Ebner 1 * (PE; Norwalk, 
CT) decision last month to 
acquire PerSeptive Bto- 
syjtems (Framtnghanx MA) via a 
$360 million stock swap was 
designed to strengthen PE in terms 
of broad capabilities in gene-based 
drug discovery. The company^ 
main goal is to develop new prod- 
ucts to improve the integration of 
genetic and protein research. 

"This merger will enhance our 
position as an effective provider of 
innovative, integrated platforms 
enabling our customers to be more 
efficient and cost-effective in bring- 
ing new pharmareiftic n lg to mar- 
keC says Tony L. White, PE* 
chairman, president and CEO. "The 
combination of our two co m p a nies 
should bolster our presence in the 
life sciences, (and it is our] belief 
that we must take bold action now 
to lead the emerging era of molecu- 
lar medicine with leading positions 
in both genetic and protein analy- 
sis." 

A driving force behind the 
merger is the vast amount of genet- 



FDA OKs Genzyme's Carticel 
Product for Damage to Knees 



— Periosteal flap 



Genzyme Ti«tm Rrpai 



Cell Processing 



Carticel. which was approved for the repair of clinically significant, symp- 
tomatic cartilaginous defects of the fernowl condyle (medial lateral or 
trochlear) caused by acute or repetitive trauma, employs a proprietary 
process to grrrw autolng/MB cartilage cells for implantation. 



f ■ lie FDA has improved a knee- 
I cartilage replacement product 
X made by Genryme Tissue 
Repair (Cambridge, MA), a track- 
ing-stock division of Genzyme 
Corp., for people with trauma- 
damaged knees. 

Carticer (autologous cultured 
chondrocytes) is the first product to 
be licensed under the FDA* pro- 
p.e 
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ic information about human dis- 
ease that is being accumulated by 
researchers and biotcch companies 
working in the area of genomics. It 
is becoming increasingly obvious 
that these data need to be comple- 
mented with technologies for 



studying proteins and protein net- 
works — a field known as pro- 
tcomics (sec GEN. ScpttmhtT 1. 
1997. p. ft 

PE officials, who claim that 
MALDI-TOF (Matrix Assisted 

SEE ACOUSmOH, P. 10 



Strategies for Target Validation 
Streamline Evaluation of Leads 



ByVkkJ Glaser 

cada Biosciences (Rich- 
mond, CA) last month 
its fust agree- 
ment with a major pharmaceutical 
eompany, signing a deal with EO 
Lilly (Indianapolis. IN) to use 
Acaciafc Genome Reporter Matrix 
(GRM) to select and < ' 



of Lilh/% lead awnpouncs. Acacia* 
yeast-based system for profiling 
drug activity is useful for evaluating 
the therapeutic potential of lead 
compounds, and H also has a role in 
the ideritification and validation of 
new drug targets. 

"Were using the ecosystem of a 
cell to allow us to deduce the mech- 
anism of action and target for any 
chemical." explains Bruce Cohen, 
president and CEO. "we screen for 
every target Hi a celt sinnthancous- 
ty... using transcription as a readout 



for how a cell is adapting to any 
perturbation,** he says. 

The GRM technology consists of 
two main databases: one is the 
genetic response profile, showing 
the effects of mutations in each 
individual yeast gene and compen- 
satory gene regulatory mecha- 
nisms; the other is the chemical 
response profile, which documents 
changes in gene expression in 
response to chemical compounds. 
Computational analysis and pattern 
matching between the genetic and 
chemical profiles yields informa- 
tion on the specificity, potency and 
side-effects risk of a drug lead 

Targeting Targets 

No longer is mapping and 
sequencing a gene—or the human 
genome— an end unto itself, but 
SEE TARGET, P. 18 



Sticky Ends 

Avlgen received two 
grants from the HIH I 
University of Cali- 
fornia for research 
on gene therapy for 
treatment of cancer & 
HIV infections. . .MUX. 
Pharmaceutical Servi- 
ce*, of Reaton, VA, 
launched the TSM Bug 
Finder, which is able 
to locate t retrieve 
client -specif led mi- 
croorganisms in real- 
time. . .Oensi* Sieor, 
Inc. will move its 
corporate staff from 
San Diego to Irvine , 
CA, by end of year... 



FDA accepted NDA from 
Sepracor for levalbu- 
terol HC1 inhalation 
solution. . .An 911. 7M 
mezzanine financing 
has been closed by 
Activated Cell Thera- 
py, which changed its 
name to Dendreon Cor- 
poration. . .Astra AB 
will build major re- 
search facility in 
Walt ham, MA, and is 
also relocating Astra 
Arena research facil- 
ity from Rochester to 
Boston area. . . Prolif- 
ic Ltd. team used a 
small peptide to in- 
hibit the E2F protein 
complex and induced 



a pop t os is in mammali- 
an tumor cells... Ver- 
tex Pharmaceuticals , 

Inc. and Alpha Thera- 
peutic Corp. ended an 
agreement to develop 
VX-366 for treatment 
of inherited hemoglo- 
bin disorders. . .Ravi - 
Cyte received Phase 1 
SBIR grant for up to 
$100,000 from NIH for 
development of proto- 
type of Ice NavlPlow 
technology for high- 
throughput screening 
. . .Covan.ce Inc. will 
invest $21 million in 
expansion and renova- 
tion of its facility 
in Indianapolis, IN. 
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merely a means in an end The criti- 
cal next step is to validate the gene 
and its protein product as a potential 
dru£ target. The Human Genome 
Project continues to produce a trea- 
sure chcsl or expressed sequence 
tags (EST*) and a tantalizing array of 
complete gene sequences. 

Companies are applying a variety 
of functional genomic strategies to 
link genes to specific diseases and to 
muhigenic phciwtypc s . Yet the ulti- 
mate challenge for pharmaceutical 
companies is to sift through all the 
sequence and differential gene 
expression data to identify the best 
targets for drug discovery. 

Spinning off technology devel- 
oped at the University of North 
Carolina (Chapel Hill), Cytogen 
Corp. (Princeton, NJ) formed its 
wholly owned subsidiary AxCetl 
Biosciences earlier this year. The 
young c o m p any is building a protein 
interaction database, cataloging all 
the interactions the modular domains 
of proteins can engage in with a 



range of ligands, in order to gain 
insight into protein function and to 
select the most critical interaction to 
target for drug dcvelopTncnt 

Ax CcllS clomng-of-ucand-targcte 
(COLT) technology employs "recog- 
nition units'* from the company's 
genetic diversity library (GDU to 
functional protein interact ions 
quanthatc their affinity. The 
company's inter-functional prot com- 
ic database (IFP-dbasc) elucidates 
protein interaction networks and 
sftjcojnMKtivity relationships based 
on tigand affinity with protein mod- 
ular domains. 



Defining Disease Pathways 

Signal PhariuaceutkaK Inc. 1 * 
(San Diego. CA) integrated drug tar- 
get and discovery effort is based on 
mapping gene-regulating pathways in 
cells and identifying small molecules 
that regulate the activation of those 
genes. In collaboration with academ- 
ic researchers, the company has iden- 
tified a large number of regulatory 
proteins in several mhogen-ectivated 
protein (MAP) kinase pathways 
(including the JNK, FRK and p38 



signaling pathways), which Signal is 
evaluating for the t reat m en t of 
autoimmune* inflammatory, cardio- 
vascular and neurologic diseases, and 
cancer. Other target identification 



programs focus on the NF-kB path- 
way, estrogen-related genes and ccn- 
traVpertphera) nervous system g^enes. 

Regulating cytokine production in 
immune and inflammatory disorders. 



and modifying bone metabolism to 
treat ostcopexosis are the focus of 
Signal's collaboration with Tanabe 
Setyakn (Osaka. Japan). Signal has 
partnered with Organon/Akzo 
Nobel (Netherlands) to identify 
estrogen-responsive genes as targets 
for treating neurodegeTieistive and 
psychiatric diseases, atherosclerosis 
and ischemia, and with Roche 
Bioscience (Wo AlU\ CA) to devel- 
op human peripheral nerve cdl lines 
for the discovery of t renm c ul* for 
pain and incontinence* 

Exrtfcb' (S. San Francisco. CA) 
strategy for target selection is to 
define disease pathways and identify 
regulatory molecules that activate or 
inhibit those biochemical/genetic 
pathways. Based on the finding that 
these pathways are conserved across 
species, the company is studying the 
model genetic systems of Drosophila 
and Caenorhabdltbt clegans. Using 
its Path Finder technology; Exefixis 
systematically introduces mutations 
into the genomes of these model 
organisms, looking for mutations 
that enhance or suppress the target 
disease-related gene. These novel 
genes then become the basis of drug 
screening assays. 

Cadus Pharmaceutical Corp. 
(TaiTytown. NY) is identifying sur- 
rogate ligandR to newly discovered 
orphan G-protein coupled trans- 
membrane receptors of unknown 
function to determine the suitability 
of the receptors as drug targets. 
Inserting the novel receptor m a 
yeast system yields a tigand that 
activates the recep to r. Access to a 
surrogate ligand allows the company 
to s creen for receptor antagonists in 
the yeast system. 

"The antagonist plus the surro- 
gate ligand gives you two probes — 
an on probe and an off probe — 
which allows you to look at func- 
tion." explains David Webb, Ph.D., 
vp of research and chief scientific 
officer. A surrogate ligand also pro- 
vides information on which G-pro- 
tein interacts with the orphan recep- 
tor and its associated signaling path- 
ways, further clarifying the role of 
the receptor as a potential drug tar- 
get. Cadus' collaboration with 
SmitbKJlne (Philadelphia) capital, 
izes on Cadus' ability to determine 
orphan receptor function, applying 
the technology to SmithKlineVi pro- 
prietary, newly discovered G-pro- 
tein receptors. 

Cadus' recombinant yeast system 
can also be used to screen cell and 
tissue extracts Tor natural ligands. 
and the conTnony is accelerating its 
internal drugsliscovcry efforts in the 
areas of cancer, mflsnunauon and 
allergy. A recent equity investment in 
Axiom Biotechnologies (San Diego, 
CA) gave Cadus a license to Axiom* 
high-throughput pharmacologic 
screening system for lead optimiza- 
tion and discovery. 

As its name implies. 
gene/Networks (Alameda. CA) 
focuses on identifying gene networks 
that contribute to mulhgenic pheno- 
types and complex disease process- 
es. The integration of mouse and 
human genetic studies forms the 
basis of the technology. The Genome 
Tagged Mice database in develop- 
ment will serve as a library of natur- 
al mouse genetic and phenotypic 
variation. Disease-related genes 
identified in mice are then evaluated 
in human family- and population- 
based studies to confirm their clini- 
cal relevance and linkages to patho- 
physiologic traits. 



Blocking Gene Expression 

Inactivating a gene known to be 
expressed in itssociation with a par- 
ticular disease is one approach to 
identifying appropriate therapeutic 
targets. The target validation and dis- 
covery program at Rtbozyme 
Pharmaceuticals. Inc. ( Boulder. 
C ( )) applies the company's riho^rnc 
ux-hnohyy to ichievc selective inhi- 
hiliim of gene expression in cell cul- 
ture and in animals. 

Correlation of the gene cxnrcs- 
%'nm inhibit ion with r*hcnotype can 
SEE TARGET, P. 3S 
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suggest die relative importance of 
the gene in disease pathology. The 
company* nuc lease-resistant 
ribozymes form the basis of a col- 
laboration with Sobering AG 
(Germany) for drug target validation 
and the devdoprnent of ribozyrne- 
based therapeutic agents, and with 
Chiron Corp. (Emeryville, CA) for 
target validatioa 

With several o nti sen se compounds 
now p rogr e ssi ng through clinical tri- 
als, the concept of using oligonu- 
cleotides to inhibit gene activity is 
not new. But rather than focusing on 
therapeutics development, Sequitnr, 
Inc. (Natick, MA) is creating anti- 
sense compounds for the purpose of 
determining gene function and vali- 
dating drug targets. Clients typically 
provide the one-year-old company 
with the sequence (or EST) of a 
potential gene target and, in return, 
Sequhur custom designs a series of 
three to six antisense compounds thst 
yield a tnrec-to-ten-foW inhibition of 
the target gene in cell culture. The 
company also provides oligofectins, 
a series of catiortk lipids, to deliver 
the oligonucleotides to a variety of 
cultured cells. 

"Differential expression informa- 
tion is just for correlation, it doesn't 
tell function or confirm what would 
be a good target,'* says Tod Woolf, 
PhJX director of technology devel- 
opment at Sequhur. Whereas, anti- 
sense compounds will inhibit a tar- 
get Sequitur offers both phospho- 
rothioate DNA antisense com- 
pounds, and its proprietary Next 
Generation chimeric oligonu- 
cleotides, which have a higher 
hybridization affinity, greater spccr- 
ficity and reduced toxicity, according 
to the company. 

Mining Pathogen Genomes 

Companies such as Human 
Geaomc Sciences (HGS; Rockville. 
MD). Incyte (Palo Alto. CA). 




AxCeB Biosciences scientists say their technology enables the rapid and 
simple Junctional identification of the two essential molecular components 
of protein interaction networks: specific recognition units that bind distinct 
modular protein domains are identified and isokned using a combination 
structiirid^funcnonaJ approach that uses bath peptide phase display Ctnedc 
Diversity Libraries (GDL) and bioinformatics, and dotting of Ligand 
Targets (COLT) technology utilizes recognition units osjuncttoruri probes to 
isolate famiiies of interactor i 



big 60% of the genes of this fungal 
pathogen. This genome will become 
part of the company* PathoSeq 
microbial database, Incyte recently 
introduced the ZooSeq animal gene 
sequence and expression database. 
The database will provide genomic 
information across various species 
commonly used in pre c li nic al drug 
testing, which may help to better 
define potential drug targets. 

Millennium Pharmaceuticals con- 
tinues to report success in identifying 
novel drug targets, having recently 
discovered a novel chemokine called 
neurotactin and a new class of MAD- 
related proteins that inhibit trans- 
forming growth factor beta (TGF-fl) 
signaling. The co mp any also 
received US. patent coverage for the 
tub genes, believed to play a role in 
obesity, and for the gene that encodes 
the protein mcbstatm, which appears 
to suppress metastasis in malignant 




Millennium Phanaacetttkab Inc. 
(Cambridge, MA) and Genome 
TherapeutJcB (Walthara, MA) are 
relying on high-speed DNA sequenc- 
ing, po si tional cloning and other 
strategies to identify specific micro- 
bial genomic sites that would be 
good targets for infectious disease 
therapeutics. 

HGS recently completed sequenc- 
ing of the bacteria] pathogen 
Streptococcus pneumoniae, which is 
the focus of an agreement with 
Hoffmann-La Roche (Basel, 
Switzerland). Roche will use the 
sequence data to develop new anti- 
infectives against S. pneumoniae. 
HGS and Roche have expanded their 
collaboration to include a nonexclu- 
sive license to access sequence infor- 
mation for the intestinal bacterium 
Enterococcus faecalis. 

Incyte Pharmaceuticals has com- 
pleted one-fold coverage of the 
Candida albicans genome, identify- 



HIGH SPECIFIC ACTIVITY 
MICROBIAL ALKALINE 
PHOSPHATASE 
from Biocatalysts 

Biocatalysts Limited, the British speciality enzyme 
company, has developed a comptetely new type of 
alkaline phosphatase with many advantages over the 
types most commonly used, 
ft is of microbial origin with a high specific activity 
(unfike that from E coB) and with higher temperature and 
storage stability compared to that from caff Intestine. 
This is the first of several new generation Diagnostic 
enzymes being developed by Biocatalysts Limited with 
greatly improved stability. 

• Non-animal source, no risk of BSE or animal 
virus contamination 

• Higher temperature stability than calf Intestine 

• Much higher specific activity than from E. coii 

• Very high storage stability even in the absence 
of glycerol 

for further details on altetine phosphatase and our other 
diagnostic enzymes contact us tfrec? at the address below or 
within North America contact our US Distributor fo&forhPettibone 
'phone: 630 350 11 16 or tax 630-350-1606 
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Smith, now a computer program- 
mer, is an expert in systems tniegnv- 
bon, Internet technologies rod die 
appbe at iooof industrial engineering 
principles to the drug discovery 
process. Before co-founding Pangea, 
be w&s the manager of software 
development «t Attorney* Briefcase, 
a legal research software com pony. 

By being "in the trenches" with 
customers and collaborators, 
Belknson and Smith sensed the 
frustration of pharmaceutical 
researchers whose ^compatible 
tools have impeded th ei r progress. 
According to BcUeoson, "Most of 
them are geared toward analyzing 
one mo lecu le at s time. It s tike emp- 
tying the ocean with an eye drop- 
per— an incompatible eye dropper at 
that A pharmaceutical company 
may have 30 different drug discov- 
ery teams with various approaches. 
The problem is to manage the 
process of experirnenting with a lot 
of different approaches, to automate 
while mainlining flexibility** 

GencWortd 2.1 enables "integra- 
tion of the entire target discovery and 
validation process,** BeUenson says. 
The commercial software package 
coordinates the entire process of 
sequence-data analysis and can be 
integrated with other pip grar ro and 
databases, according to Smith, who 
adds that it handles thousands of 
sequence results, organizes and auto- 
mates annotation and seamlessly 
interacts with growing genome data- 
bases. Simple forms and menus 
enable users to turn raw sequence 
data into crucial knowledge for drug 
discovery by applying algorithms to 
sequences, creating custom analysis 
strategies and producing useful 
reports, without the need for writing 
computer code. Gene World 2.1 runs 
on a variety of platforms and operat- 
ing systems. 

Pairing industrial relational data- 
base-management systems with a 
web-browser interface, Pangea* 
Operating System of Drug 
Discovery" is an open-coinputing 
framework that allows dienvserver 
and Java-enabled web-based tech- 
nologies to collect, organize and ana- 
lyze drug discovery mformation for 
pharmaceutical companies to simpli- 
fy and accelerate drug discovery. The 
technology unites automated 
genomics database analysis for drug 
target site selection, chemical infor- 
mation database analysis and large- 
scale combinatorial chemistry pro- 
ject management and high-through- 
put screening project management 
for drug lead efficacy analysis. 
Pangea officials maintain that these 
integrated elements provide a unified 
environment for chemists, biologists 
and others involved in the drug dis- 
covery process to work together with 
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commercial and public domain 
software. 

Pangeas Operating System of 
Drug Discovery can accommodate 
Sybase, Oracle or Informix relation- 
al database-management systems 
and any version of UNIX. It absorbs 
new data formats, databases, algo- 
rithms and analysis paradigms into 
the automated workflow without 
software nradificauons. Netscape 
Navigator*" provides a friendly user 
interface from PC, Macintosh, and 
UNIX workstations. 

In the near term, Pangea plans to 
complete its bioinformatics core 
with two more programs. Gene 
foundry, a sample tracking and 
workflow sequence package for 
DNA sequence and fragment infor- 
matioo, will also offer interaction 
with robots, reagent tracking and 
traibteshooting. Gene Thesaurus, 
the other package is a "warehouse 
of bioinformatics data,** 
Bellenson. 
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GTAC Chairman, Professor 
Norman C. Nevin, said 1996 saw 
"four important developments'*: an 
increase in enquiries and submis- 
sions made to GTAC; an increase in 
the complexity of submitted proto- 
cols; a continuing shift from gene 
therapy for single-gene disorders 
toward strategics aimed at tumour 
destruction in cancer, and a growth 
in international sponsor sh ip of UK. 
gem therapy trials. 

Since 1993. GTAC and its prede- 
cessor, the Clothier Committee, have 
approved 18 UK. gene therapy clini- 
cal trials ( 1 3 of winch have been car- 
ried out), which are listed in the 
report The disease areas targeted by 
these trials include severe combined 
nnmunodeticiency (1 trial), cystic 
fibrosis (6), metastatic nieianotna (2\ 
ryrnphoma (2), neuroblastoma (\\ 
breast cancer (I). Hurler's syndrome 
f I V. cervical cancer ( 1 J, glioblastoma 



breast cancer, breast cancer with over 
metastases, gUctiastorna, malignant 
ascites due to gastrob^estinal cancer 
and ovarian cancer. 

Copies of the GTAC thrid annual 
report are available from the GTAC 
Secretariat, VVUlington House, 133- 
155 Waterloo Road, London SE1 
8UG.UK. 

Coated Lenses Prevent PCO 

Scientists in the UK. say it may be 
poss ib le to prevent posterior capsul e 
opacification (PCO), a corrarion 
complication following cataract 
surgery, by using the implanted pory- 
rnetlrylmethacrylate (PMMA) 
intraocular lens as a drug delivery 
system. PCO occurs in 30-50% of 
cataract surgery patients as a result of 
stimulated cell growth within the 
remaining capsula r bag. The condi- 
tion causes a decline in visual acuity 
and r e q u ires expensive laser treat- 
ment, thus negating the routine use of 
cataract surgery m underdeveloped 
ccajntries. explains G. Duncan, at the 
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Exploring the Metabolic and Genetic Control of 
Gene Expression on a Genomic Scale 

Joseph L DeRisi, Vishwanath R: lyerv Patrick 0>Brown* : 

DNA microarrays containing virtually every gene of Saccharomyces cerev'tsiae were used 
to carry out a comprehensive investigation of the temporal program of gene expression 
accompanying the metabolic shift from fermentation to respiration. The expression 
profiles observed for genes with known metabolic functions pointed to features of the 
metabolic reprogramming that occur during the diauxic'shift, and the expression patterns 
of many previously uncharacterized genes provided clues to their possible functions. The 
same DNA microarrays were also used to identify genes whose expression was affected 
by deletion of the transcriptional cb-repressorJUP7 or overexpression of the transcrip- 
tional activator- YAP1. These results demonstrate the feasibility and utility of this ap- 
proach to genomewide exploration of gene expression patterns. 5 



The complete sequences of nearly a dozen 
microbial genomes are known, and in the 
next several years we expect to know the 
complete genome sequences of several 
metazoans, including the human genome; 
Defining the role of each gene in these 
genomes will be a formidable task, and un- 
derstanding how the genome functions as a 
whole in the complex , natural history of a 
living organism presents an even ^greater 
challenge. 

Knowing when and where a gene is 
expressed often provides a strong clue as to 
its biological role. Conversely, the pattern 
of genes expressed in a cell can provide 
detailed information about its state. Al- 
though regulation of protein abundance in 
a cell is by no means accomplished solely 
by regulation of mRNA, virtually all dif- 
ferences in cell type or state are correlated 
with changes in the mRNA levels of many 
genes. This is fortuitous because the only 
specific reagent required to measure the 
abundance of the mRNA for a specific 
gene is a cDNA sequence. DNA microar- 
rays, consisting of thousands of individual 
gene sequences printed in a high-density 
array on a glass microscope slide (J, 2), 
provide a practical and economical tool 
for studying gene expression on a very 
large scale (3-6). 

Saccharomyces cerevisiae is an especially 

Department of Bwchemistry, Stanford University School 
of Medicine. Howard Hughes Medical Institute, Stanford. 
CA 94305-5428, USA. 

* To whom correspondence should be addressed. E-mail: 
pbrown®cmgm. stanford.edu 



favorable organism in which to conduct a 
systematic investigation of gene expression. 
The genes are easy to recognize in the ge- 
nome sequence, cis regulatory elements are 
generally compact and close to the tran- 
scription units/ much is already known 
about its genetic regulatory mechanisms, 
and a powerful set of tools is available for its 
analysis. * .*"'** 

A recurring cycle in the natural history 
of yeast involves a shift from, anaerobic 
(fermentation) to aerobic (respiration) me- 
tabolism. Inoculation of yeast into a medi- 
um rich in sugar is followed by rapid; growth 
fueled by fermentation, with the production 
of ethanol. When the 'fermentable sugar is 
exhausted, the yeast cells turn to ethanol as 
a carbon source for aerobic growth. This 
switch from anaerobic growth to aerobic 
respiration upon depletion of glucose, re- 
ferred to as the diauxic shift, is correlated 
with widespread changes in the expression 
of genes involved in fundamental cellular 
processes such as carbon metabolism, pro- 
tein synthesis, and carbohydrate- storage 
(7). We used DNA microarrays to charac- 
terize the changes in gene expression that 
take place during this process for nearly the 
entire genome, and to investigate the ge- 
netic circuitry that regulates and executes 
this program. 

Yeast open reading frames (ORFs) were 
amplified by the polymerase chain reaction 
(PCR), with a commercially available set of 
primer pairs (8). DNA microarrays, con- 
taining approximately 6400 distinct DNA 
sequences, were printed onto glass slides by 



' using £ simple robotic printing device: (9). 
Cells from an exponentially growing 'culture 

"of yeast were inoculated 'into fresh medium 
and grown at 30°G for .2 1 ! hours. - After an 
initial 9 hours of growth, samples were har- 
vested at seven successive 2-hour intervals, 
and mRNA was isolated (10). Fluorescently 
labeled cDN A was prepared by. reverse tran- 
scription .- in the presence of 1 Cy3 ( green)- 
or Gy5(red)4abeled deoxyuridine triphos- 
phate (dUTP) (11) and then hybridized to 
the microarrays ( 12). To maximize' the, re-. . 
liability -with which changes in expression 
levels could be discerned, we labeled cDN A 
prepared from, cells at each successive time 
point with Gy5, then mixed it with a Gy3- 
labeled "reference" cDNA sample prepared 
from cells harvested at the first interval 
after inoculation. ■ In this experimental de- 
sign, the relative fluorescence intensity 
measured for the Cy3 and Cy5 fluors at 
each array element provides a reliable mea- 
sure of; the relative abundance of the corre- 
sponding mRNA in the two cell popula- 
tions (Fig. 1). Data from the series of seven 
samples (Fig. 2), consisting of more than 
43,000 expression-ratio : measurements, 
were organized into a database to facilitate 
efficient exploration and analysis of the 
results. This database is publicly available 
on the Internet (13). 

During exponential growth in glucose-: 
rich medium, the .global pattern of gene - 
expression was remarkably stable. Indeed, 
when gene expression patterns between the 
first two cell samples (harvested at a 2-hour' 
interval) were compared, mRNA levels dif- 
fered by a factor of 2 or more for only 19' 
genes (0.3%), and the largest of these dif- 
ferences was only 2.7-fold (14). However, as . 
glucose was progressively depleted from the 
growth media . during the course of the ex- 
periment, a marked change was seen in the 
global pattern of gene expression. mRNA 
levels for approximately 710 genes were 
induced by a factor of at least 2, and the . 
mRNA levels for approximately 1030 genes 
declined by a factor of at least 2. Messenger 
RNA levels for 183 genes increased by a 
factor of at least 4, and mRNA levels for 
203 genes diminished by a factor of at least 
4- About half of these differentially ex- 
pressed genes have no currently recognized 
function and are not yet named. Indeed, 
more than 400 of the differentially ex- 
pressed genes have no apparent homology 
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to any gene whose function ^ is known (15). 
The' responses., of these previously Vurichar- 
acterized genes to the diauxic, shift therefore 
provides the first small clue to their possible 
roles. ^\ ! :"' ,tt '/ 

v The global vie w of . changes in expres- 
sion of genes with known . functions pro- 
vides a vivid picture of the way in which 
the cell adapts to Va -changing ehvirpn- 
ment. Figure 3 shows a portion of the yeast 
metabolic pathways involved : in 'carbon 
aria* :. energy metabbhsmZ/Mapping the 
changes .we observed nv the ^mRN As . en- 
coding each enzyme onto' this framework 
allowed us to infer the redirection in. the 
flow of metabolites, through this system. 
We observediarge inductions of the genes 
coding for the enzymes aldehyde dehydro- 
genase (ALD2) and acetyl-coenzyme 
A(CoA) synthase - (ACS J ), ■> which func- 
tion together to convert the products of 
alcohol dehydrogenase into acetyl-CoA, 
which in tum is used to fuel the tricarbox- 
ylic acid (TCA) cycle and rheglyoxylate 
cycle.. The concomitant shutdown of tran- 
scription of the genes encoding, pyruvate 
decarboxylase and induction of pyruvate 
carboxylase rechannels .pyruvate away 
from acetaldehyde, and instead to oxalac- 
etate, where it can serve, -to supply the 
TCA cycle and gluconeogenesis. Induc- 
tion of the pivotal genes PCKJ, encoding 
phosphoenol pyruvate car boxy kinase, and 
FBPJ, encoding fructose 1,6-biphos- 
phatase, switches the directions of two key 
irreversible steps in glycolysis, reversing 
the flow of metabolites along the* revers- 
ible steps of the glycolytic pathway toward 
the essential; biosynthetic precursor, glu- 
coses-phosphate. Induction of the genes 
coding for the trehalose synthase and gly- 
cogen synthase complexes promotes chan- 
neling of glucose-6-phosphate into these 
carbohydrate storage pathways. 

Just as the changes in expression of 
genes encoding pivotal enzymes can pro- 
vide insight into metabolic reprogram- 
ming, the behavior of large groups of func- 
tionally related genes can provide a broad 
view of the systematic way in which the 
yeast cell adapts to a changing environ- 
ment (Fig. 4). Several classes of genes, 
such as cytochrome c-related genes and 
those involved in the TCA/glyoxylate cy- 
cle and carbohydrate storage, were coordi- 
nate^ induced by glucose exhaustion. In 
contrast, genes devoted to protein synthe- 
sis, including ribosomal proteins, tRNA 
synthetases, and translation, elongation, 
and initiation factors, exhibited a coordi- 
nated decrease in expression. More than 
95% of ribosomal genes showed at least 
twofold decreases in expression during the 
diauxic shift (Fig. 4) (J 3). A noteworthy 
and illuminating exception was that the 



■genes encoding mitochondrial ribosomal 
genes were generally ^induced rather than 

.-.repressed after glucose limitation i high- 
lighting the requirement for mitochondrial 

^ biogenesis (13). As more is learned' about 
the .functions, of,, every'; .gene in the yeast 

^genome,- the. ability to gain insight into a 
cell's response; to a changing environment 
through its global gerie.expression patterns 
will, become increasingly powerful, r 

■'Several distinct temporal patterns of ex- 
pression could be recognized, and sets of 
genes could; be grouped on the basis of the 
similarities in theirexpression patterns. The 

/ characterized members of each of these 
groups also iriared^ important similarities in 
thedr^ functions. - Moreover, * in . most ' cases, 
common regulatory mechanisms could be 
inferred for sets of genes with similar expres- 
sion profiles. •pFot^ example, seven genes 
"showed a late induction profile, with mRNA 
levels increasing by more than ninefold at 



the last timepoint but less than threefold at 
the preceding timepoint <Fig. : 5B): All of 
these genes were, known^td be glucose-re- 
pressed, and five of the seven were previously 
noted do share a cornmon upstream activat- 
ing; sequence;{JJAS), the carbon source re- 
spohse element /(CSRE) (16^20). A . search 
in the promoter' regions of the remaining two 
genes, fACRJ and /DP2, f revealed" that 
ACRi, a "gene essential for ACS J activity, 
also possessed a consensus CSRE motif, but 
interestingly; IDP2 did riot. A search of the 

_ entire yeast genome sequence for the con- 
sensus -CSRE motif revealed, only four addi- 
tional candidate genes, none of which 
showed^ similar induction. ; ^ * • ■ 

Examples from ; add itiohal groups of 

j : genes that shared expression profiles are 
illustrated in : Fig; 5, C : through F. the 
sequences upstream' of the named genes in 
Fig., 5Q\ all contain stress, response ele- 
ments (STRE), rand, with the exception 




Fig. 1. Yeast genome microarray. The actual size of. the microarray is 18 mm by 18 mm. The 
microarray was printed as described (9). This image was obtained with the same fluorescent 
scanning confocal microscope used to collect all the data we report {49). A fluorescently labeled 
cDNA probe was prepared from mRNA isolated from cells harvested shortly after inoculation {culture 
density of <5 x 10 8 cells/ml and media glucose level of 19 g/liter) by reverse transcription in the 
presence of Cy3-dUTP. Similarly, a second probe was prepared from mRNA isolated from cells taken 
from the same culture 9.5 hours later (culture density of -2 x 10 8 cells/ml, with a glucose level of 
<0.2 g/liter) by reverse transcription in the presence of Cy5-dUTP. In this image, hybridization of the 
Cy3-dirrP-rlabeled cDNA (that is, mRNA expression at the initial timepoint) is represented as a green 
signal, and hybridization of Cy5-dUTP-labeled cDNA (that is, mRNA expression at 9.5 hours) is 
represented as a red signal. Thus, genes induced or repressed after the diauxic shift appear in this 
image as red and green spots, respectively. Genes expressed at roughly equal levels before and after 
the diauxic shift appear in this image as yellow spots. 
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of HSP42, have previously been shown to 
be controlled at least in part by these 
elements (21-24). Inspection of the se- 
quences upstream of HSP42 and the two 
uncharacterized genes shown in Fig. 5C, 
YKL026c, a hypothetical protein with 
similarity to glutathione peroxidase, and 
YGR043c, a putative transaldolase, re- 
vealed that each of these genes also pos- 
sess repeated upstream copies of the stress- 
responsive CCCCT motif. Of the 13 ad- 
ditional genes in the yeast genome that 
shared this expression profile [including 
HSP30, ALD2, OM45, and 10 uncharac- 
terized ORFs (25)], nine contained one or 
more recognizable STRE sites in their up- 
stream regions. 

The heterotrimeric transcriptional acti- 
vator complex HAP2,3,4 has been shown 
to be responsible for induction of several 
genes important for respiration (26-28). 
This complex binds a degenerate consensus 
sequence known as the CCAAT box (26). 
Computer analysis, using the consensus se- 
quence TNRYTGGB (29), has suggested 
that a large number of genes involved in 
respiration may be specific targets of 
HAP2,3,4 (30). Indeed, a putative 
HAP2,3 t 4 binding site could be found in 
the sequences upstream of each of the seven 
cytochrome c-related genes that showed 
the greatest magnitude of induction (Fig. 
5D). Of 12 additional cytochrome c-related 
genes that were induced, HAP2,3,4 binding 
sites were present in all but one. Signifi- 
cantly, we found that transcription of 
HAP4 itself was induced nearly ninefold 
concomitant with the diauxic shift. 

Control of ribosomal protein biogenesis 
is mainly exerted at the transcriptional 
level, through the presence of a common 
upstream-activating element (UAS ) 
that is recognized by the Rapl DNA-bind- 
ing protein (31 , 32). The expression pro- 
files of seven ribosomal proteins are shown 
in Fig. 5F. A search of the sequences 
upstream of all seven genes revealed con- 
sensus Rapl -binding motifs (33). It has 
been suggested that declining Rapl levels 
in the cell during starvation may be re- 
sponsible for the decline in ribosomal pro- 
tein gene expression (34). Indeed, we ob- 
served that the abundance of RAP J 
mRNA diminished by 4.4-fold, at about 
the time of glucose exhaustion. 

Of the 149 genes that encode known or 
putative transcription factors, only two, 
HAP4 and SlP4 t were induced by a factor of 
more than threefold at the diauxic shift. 
S1P4 encodes a DNA-binding transcrip- 
tional activator that has been shown to 
interact with Snfl , the "master regulator" of 
glucose repression (35). The eightfold in- 
duction of S1P4 upon depletion of glucose 
strongly suggests a role in the induction of 



downstream genes at the diauxic shift. 

Although most of the transcriptional 
responses that we observed were not pre- 
viously known, the responses of many 
genes during the diauxic shift have been 
described. Comparison of the results we 
obtained by DNA microarray hybridiza- 
tion with previously reported results there- 
fore provided a strong test of the sensitiv- 
ity and accuracy of this approach. The 
expression patterns we observed for previ- 
ously characterized genes showed almost 
perfect concordance with previously pub- 
lished results (36). Moreover, the differ- 
ential expression measurements obtained 
by DNA microarray hybridization were re- 
producible in duplicate experiments. For 
example, the remarkable changes in gene 
expression between cells harvested imme- 
diately after inoculation and immediately 
after the diauxic shift (the first and sixth 
intervals in this time series) were mea- 
sured in duplicate, independent DNA mi- 
croarray hybridizations. The correlation 
coefficient for two complete sets of expres- 
sion ratio measurements was 0.87, and for 
more than 95% of the genes, the expres- 



sion ratios measured in these duplicate 
experiments differed by less than a factor 
of 2. However, in a few cases, there were 
discrepancies between our results and pre- 
vious results, pointing to technical limita- 
tions that will need to be addressed as 
DNA microarray technology advances 
(37, 38). Despite the noted exceptions, 
the high concordance between the results 
we obtained in these experiments and 
those of previous studies provides confi- 
dence in the reliability and thoroughness 
of the survey. 

The changes in gene expression during 
this diauxic shift are complex and involve 
integration of many kinds of information 
about the nutritional and metabolic state 
of the cell. The large number of genes 
whose expression is altered and the diver- 
sity of temporal expression profiles ob- 
served in this experiment highlight the 
challenge of understanding the underlying 
regulatory mechanisms. One approach to 
defining the contributions of individual 
regulatory genes to a complex program of 
this kind is to use DNA microarrays to 
identify genes whose expression is affected 



Fig, 2. The section of the ar- 
ray indicated by the gray box 
in Rg. 1 is shown for each of 
the experiments described 
here. Representative genes 
are labeled. In each of the ar- 
rays used to analyze gene 
expression during the diauxic 
shift, red spots represent 
genes that were induced rel- 
ative to the initial timepoint, 
and green spots represent 
genes that were repressed 
relative to the initial timepoint. 
In the arrays used to analyze 
the effects of the fup7A mu- 
tation and YAP1 overexpres- 
sion, red spots represent 
genes whose expression was 
increased and green spots 
represent genes whose ex- 
pression was decreased by 
the genetic modification. Note 
that distinct sets of genes are 
induced and repressed in the 
different experiments. The 
complete images of each of 
these arrays can be viewed on 
the Internet (73). Cell density 
as measured by optical densi- 
ty (OD) at 600 nm was used to 
measure the growth of the 
culture. 
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by mutations in each putative regulatory 
gene. As a test of this strategy, we analyzed 
the genomewide changes in gene expression 
that result from deletion of tie TUPl gene. 
Transcriptional repression of many genes by 
glucose requires the DNA-binding repressor 



Migl and is mediated by recruiting the tran- 
scriptional co-repressors Tupl and Cyc8/ 
Ssn6 (39). Tupl has also been implicated in 
repression of oxygen-regulated, mating-type- 
specific, and DNA-damage-inducible genes 
(40). 



Debranching 
6.1 




9 5.8 
Glucose 

2.2 



Pentose Phosphate 
Pathway, RNA, DMA, 
Proteins 




Fig. 3. Metabolic reprog ramming inferred from global analysis of changes in gene expression. Only key 
metabolic intermediates are identified. The yeast genes encoding the enzymes that catalyze each step 
in this metabolic circuit are identified by name in the boxes. The genes encoding succinyl-CoA synthase 
and glycogen-debranching enzyme have not been explicitly identified, but the ORFs YGR244 and 
YPR184 show significant homology to known succinyl-CoA synthase and glycogen-debranching en- 
zymes, respectively, and are therefore included in the corresponding steps in this figure. Red boxes with 
white lettering identify genes whose expression increases in the diauxic shift. Green boxes with dark 
green lettering identify genes whose expression diminishes in the diauxic shift. The magnitude of 
induction or repression is indicated for these genes. For multimeric enzyme complexes, such as 
succinate dehydrogenase, the indicated fold-induction represents an unweighted average of all the 
genes listed in the box. Black and white boxes indicate no significant differential expression (less than 
twofold). The direction of the arrows connecting reversible enzymatic steps indicate the direction of the 
flow of metabolic intermediates, inferred from the gene expression pattern, after the diauxic shift. Arrows 
representing steps catalyzed by genes whose expression was strongly induced are highlighted in red. 
The broad gray arrows represent major increases in the flow of metabolites after the diauxic shift, 
inferred from the indicated changes in gene expression. 



Wild-type yeast cells and cells bearing 
a deletion of the TUP J gene (tupl A) were 
grown in parallel cultures in rich medium 
containing glucose as the carbon source. 
Messenger RNA was isolated from expo- 
nentially growing cells from the two pop- 
ulations and used to prepare cDNA la- 
beled with Cy3 (green) and Cy5 (red), 
respectively (J I). The labeled probes were 
mixed and simultaneously hybridized to 
the microarray. Red spots on the microar- 
ray therefore represented genes whose 
transcription was induced in the tup J A 
strain, and thus presumably repressed by 
Tupl (41). A representative section of the 
microarray (Fig. 2, bottom middle panel) 
illustrates that the genes whose expression 
was affected by the tupl A mutation, were, 
in general, distinct from those induced 
upon glucose exhaustion (complete images 
of all the arrays shown in Fig. 2 are avail- 
able on the Internet (13)]. Nevertheless, 
34 (10%) of the genes that were induced 
by a factor of at least 2 after the diauxic 
shift were similarly induced by deletion of 
TUP J , suggesting that these genes may be 
subject to TUP J -mediated repression by 
glucose. For example, SUC2, the gene en- 
coding invertase, and all five hexose trans- 
porter genes that were induced during the 
course of the diauxic shift were similarly 
induced, in duplicate experiments, by the 
deletion of TUPL 

The set of genes affected by Tupl in this 
experiment also included ct-glucosidases, 
the mating- type-specific genes MFA1 and 
MFA2, and the DNA damage-inducible 
RNR2 and KNR4, as well as genes involved 
in flocculation and many genes of unknown 
function. The hybridization signal corre- 
sponding to expression of TUP] itself was 
also severely reduced because of the (in- 
complete) deletion of the transcription unit 
in the tup J A strain, providing a positive 
control in the experiment (42). 

Many of the transcriptional targets of 
Tupl fell into sets of genes with related 
biochemical functions. For instance, al- 
though only about 3% of all yeast genes 
appeared to be TUP J -repressed by a factor 
of more than 2 in duplicate experiments 
under these conditions, 6 of the 13 genes 
that have been implicated in flocculation 
(15) showed a reproducible increase in 
expression of at least twofold when TUPl 
was deleted. Another group of related 
genes that appeared to be subject to TUP I 
repression encodes the serine-rich cell 
wall mannoproteins, such as Tipl and 
Tirl/Srpl which are induced by cold 
shock and other stresses (43), and similar, 
serine-poor proteins, the seripauperins 
(44). Messenger RNA levels for 23 of the 
26 genes in this group were reproducibly 
elevated by at least 2.5-fold in the tupl A 
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strain, and 18 of these genes were induced 
by more than sevenfold when TUP I was 
deleted. In contrast, none of 83 genes that 
could be classified as putative regulators of 
the cell division cycle were induced more 
than twofold by deletion of TUP I . Thus, 
despite the diversity of the regulatory sys- 
tems that employ Tupl, most of the genes 
that it regulates under these conditions 
fall into a limited number of distinct func- 
tional classes. 

Because the microarray allows us to 
monitor expression of nearly every gene in 
yeast, we can, in principle, use this ap- 
proach to identify all the transcriptional 
targets of a regulatory protein like Tupl. It 
is important to note, however, that in any 
single experiment of this kind we can only 
recognize those target genes that are nor- 
mally repressed (or induced) under the 
conditions of the experiment. For in- 
stance, the experiment described here an- 
alyzed a MAT a strain in which MFAJ 
and MFA2, the genes encoding the a- 
factor mating pheromone precursor, are 
normally repressed. In the isogenic tup J A 
strain, these genes were inappropriately 
expressed, reflecting the role that Tupl 
plays in their repression. Had we instead 
carried out this experiment with a MATA 
strain (in which expression of MFAJ and 
MFA2 is not repressed), it would not have 
been possible to conclude anything re- 
garding the role of Tupl in the repression 
of these genes. Conversely, we cannot dis- 
tinguish indirect effects of the chronic 
absence of Tupl in the mutant strain from 
effects directly attributable to its partici- 
pation in repressing the transcription of a 
gene. 

Another simple route to modulating the 
activity of a regulatory factor is to overex- 
press the gene that encodes it. YAP I en- 
codes a DNA-binding transcription factor 
belonging to the b-zip class of DNA-bind- 
ing proteins. Overexpression of YAP] in 
yeast confers increased resistance to hydro- 
gen peroxide, o-phenanthroline, heavy 
metals, and osmotic stress (45). We ana- 
lyzed differential gene expression between a 
wild-type strain bearing a control plasmid 
and a strain with a plasmid expressing YAPJ 
under the control of the strong GAL1-10 
promoter, both grown in galactose (that is, 
a condition that induces YAPJ overexpres- 
sion). Complementary DNA from the con- 
trol and YAPJ overexpressing strains, la- 
beled with Cy3 and Cy5, respectively, was 
prepared from mRNA isolated from the two 
strains and hybridized to the microarray. 
Thus, red spots on the array represent genes 
that were induced in the strain overexpress- 
ing YAPJ. 

Of the 17 genes whose mRNA levels 
increased by more than threefold when 



YAPJ was overexpressed in this way, five 
bear homology to aryl-alcohol oxidoreduc- 
tases (Fig. 2 and Table 1). An additional 
four of the genes in this set also belong to 
the general class of dehydrogenases/oxi- 
doreductases. Very little is known about 
the role of aryl-alcohol oxidoreductases in 
S. cerevisiae, but these enzymes have been 
isolated from ligninolytic fungi, in which 
they participate in coupled redox reac- 
tions, oxidizing aromatic, and aliphatic 
unsaturated alcohols to aldehydes with the 
production of hydrogen peroxide (46, 47). 
The fact that a remarkable fraction of the 
targets identified in this experiment be- 
long to the same small, functional group of 
oxidoreductases suggests that these genes 

Fig. 4. Coordinated reg- 
ulation of functionally re- 
lated genes. The curves 
represent the average in- 
duction or repression ra- 
tios for all the genes in 
each indicated group. 
The total number of 
genes in each group was 
as follows: ribosornaJ 
proteins, 112; translation 
elongation and initiation 

factors, 25; tRNA synthetases (excluding mitochondial synthetases), 17; glycogen and trehalose syn- 
thesis and degradation, 15; cytochrome c oxidase and reductase proteins, 19; and TCA- and glyoxy- 
late-cycle enzymes, 24. 

Table 1 . Genes induced by YAP1 overexpression. This list includes all the genes for which mRNA levels 
increased by more than twofold upon YAP1 overexpression in both of two duplicate experiments, and 
for which the average increase in mRNA level in the two experiments was greater than threefoJd(50). 
Positions of the canonical Yap1 binding sites upstream of the start codon, when present, and the 
average fold-increase in mRNA levels measured in the two experiments are indicated. 



might play an important protective role 
during oxidative stress. Transcription of a 
small number of genes was reduced in the 
strain overexpressing Yapl. Interestingly, 
many of these genes encode sugar per- 
meases or enzymes involved in inositol 
metabolism. 

We searched for Yapl -binding sites 
(TTACTAA or TGACTAA) in the se- 
quences upstream of the target genes we 
identified (48). About two-thirds of the 
genes that were induced by more than 
threefold upon Yapl overexpression had 
one or more binding sites within 600 bases 
upstream of the start codon (Table 1), sug- 
gesting that they are directly regulated by 
Yapl. The absence of canonical Yapl-bind- 



-o~ GtycogerVTretiatose 
-o— Cytocftrome-c 
-»-TCA / Glyoxalate cycle 



- RibosornaJ proteins 
-Translation donga tion/Jnlt 
-tRNA synthetase 




ORF 



Distance of Yapl 
site from ATG 



Gene 



Description 



Fold- 
increase 



YNL331C 
YKL071W 
YML007W 

YFL056C 

YU060C 
YOL165C 

YCR107W 
YML116W 

YBR008C 

YCLX08C 
YJR155W 
YPU71C 

YLR460C 

YKR076W 

YHR179W 

YML131W 

YOL126C 



Putative aryl-alcohol reductase 12.9 

1 62-222 (5 sites) Similarity to bacterial csgA protein 1 0.4 

YAP1 Transcriptional activator involved in 9.8 
oxidative stress response 

223, 242 Homology to aryl-alcohol 9.0 

dehydrogenases 

98 Putative glutathione transferase 7,4 

266 Putative aryl-alcohol dehydrogenase 7.0 

(NADP+) 

Putative aryl-alcohol reductase 6.5 

409 ATR1 Aminotriazole and 4-nitroquinoline 6.5 

resistance protein 

142, 167, 364 Homology to benomyl/methotrexate 6.1 

resistance protein 

Hypothetical protein 6.1 

Putative aryt-aJcohol dehydrogenase 6.0 

148,212 OYE3 NAPDH dehydrogenase (old yellow 5.8 

enzyme), isoform 3 

1 67, 31 7 Homology to hypothetical proteins 4 7 

YCR102C and YNL134c 

17 8 Homology to hypothetical protein 4 5 

YMR251W 

327 OYE2 NAD(P)H oxidoreductase (old yellow 4.1 

enzyme), isoform 1 

507 Similarity to A. thafiana zeta-crystalfin 3.7 

homolog 

MDH2 Malate dehydrogenase 3.3 
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ing sites upstream of the others may reflect 
an ability of Yapl to bind sites that differ 
from the canonical binding sites, perhaps in 
cooperation with other factors, or less like- 
ly, may represent an indirect effect of Yapl 
overexpression, mediated by one or more 
intermediary factors. Yapl sites were found 
only four times in the corresponding region 
of an arbitrary set of 30 genes that were not 
differentially regulated by Yapl. 

Use of a DNA microarray to character- 
ize the transcriptional consequences of 
mutations affecting the activity of regula- 
tory molecules provides a simple and pow- 
erful approach to dissection and character- 
ization of regulatory pathways and net- 



works. This strategy also has an important 
practical application in drug screening. 
Mutations in specific genes encoding can- 
didate drug targets can serve as surrogates 
for the ideal chemical inhibitor or modu- 
lator of their activity. DNA microarrays 
can be used to define the resulting signa- 
ture pattern of alterations in gene expres- 
sion, and then subsequently used in an 
assay to screen for compounds that repro- 
duce the desired signature pattern. 

DNA microarrays provide a simple and 
economical way to explore gene expres- 
sion patterns on a genomic scale. The 
hurdles to extending this approach to any 
other organism are minor. The equipment 
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Fig. 5. Distinct temporal patterns of induction or repression help to group genes that share regulatory 
properties. (A) Temporal profile of the cell density, as measured by OD at 600 nm and glucose 
concentration in the media. (B) Seven genes exhibited a strong induction (greater than ninefold) only at 
the last timepoint (20.5 hours). With the exception of IDP2, each of these genes has a CSRE UAS. There 
were no additional genes observed to match this profile. (C) Seven members of a class of genes marked 
by early induction with a peak in mRNA levels at 1 8.5 hours. Each of these genes contain STRE motif 
repeats in their upstream promoter regions. (D) Cytochrome c oxidase and ubiquinol cytochrome c 
reductase genes. Marked by an induction coincident with the diauxic shift, each of these genes contains 
a consensus binding motif for the HAP2.3.4 protein complex. At least 17 genes shared a similar 
expression profile. (E) SAM1, GPP/, and several genes of unknown function are repressed before the 
diauxic shift, and continue to be repressed upon entry into stationary phase, (F) Ribosomai protein 
genes comprise a large class of genes that are repressed upon depletion of glucose. Each of the genes 
profiled here contains one or more RAP1 -binding motifs upstream of its promoter. RAP1 is a transcrip- 
tional regulator of most ribosomai proteins. 



required for fabricating and using DNA 
microarrays (9) consists of components 
that were chosen for their modest cost and 
simplicity. It was feasible for a small group 
to accomplish the amplification of more 
than 6000 genes in about 4 months and, 
once the amplified gene sequences were in 
hand, only 2 days were required to print a 
set of 110 microarrays of 6400 elements 
each. Probe preparation, hybridization, 
and fluorescent imaging are also simple 
procedures. Even conceptually simple ex- 
periments, as we described here, can yield 
vast amounts of information. The value of 
the information from each experiment of 
this kind will progressively increase as 
more is learned about the functions of 
each gene and as additional experiments 
define the global changes in gene expres- 
sion in diverse other natural processes and 
genetic perturbations. Perhaps the greatest 
challenge now is to develop efficient 
methods for organizing, distributing, inter- 
preting, and extracting insights from the 
large volumes of data these experiments 
will provide. 
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the pory-t-fysine surface was blocked by a 15- min 
incubation in a solution of 70 mM succinic anhydride 
dissolved in a solution consisting of 315 mJ of 1- 
methyl-2-pyrrofidinone (Aldrich) and 35 ml of 1 M 
boric acid (pH 8.0). Directly after the blocking reac- 
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